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Description 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

[0001] The present invention relates to novel polynucleotides derived from microorganisms belonging to coryneform 
bacteria and fragments thereof, polypeptides encoded by the polynucleotides and fragments thereof, polynucleotide 
arrays comprising the polynucleotides and fragments thereof, computer readable recording media in which the nucle- 
ic otide sequences of the polynucleotide and fragments thereof have been recorded, and use of them as well as a method 
of using the polynucleotide and/or polypeptide sequence information to make comparisons. 

2. Brief Description of the Background Art 

15 [0002] Coryneform bacteria are used in producing various useful substances, such as amino acids, nucleic acids, 
vitamins, saccharides (for example, ribulose), organic acids (for example, pyruvic acid), and analogues of the above- 
described substances (for example, N-acetylamino acids) and are very useful microorganisms industrially. Many mu- 
tants thereof are known. 

[0003] For example, Corynebacterium glutamicum is a Gram-positive bacterium identified as a glutamic acid-pro- 
20 ducing bacterium, and many amino acids are produced by mutants thereof. For example, 1 ,000,000 ton/year of L- 
glutamic acid which is useful as a seasoning for umami (delicious taste), 250,000 ton/year of L-lysine which is a valuable 
additive for livestock feeds and the like, and several hundred ton/year or more of other amino acids, such as L-arginine, 
L-proline, L-glutamine, L -tryptophan, and the like, have been produced in the world (Nikkei Bio Yearbook 99, published 
by Nikkei BP (1998)). 

25 [0004] The production of amino acids by Corynebacterium glutamicum is mainly carried out by its mutants (metabolic 
mutants) which have a mutated metabolic pathway and regulatory systems. In general, an organism is provided with 
various metabolic regulatory systems so as not to produce more amino acids than it needs. In the biosynthesis of L- 
lysine, for example, a microorganism belonging to the genus Corynebacterium is under such regulation as preventing 
the excessive production by concerted inhibition by lysine and threonine against the activity of a biosynthesis enzyme 

30 common to lysine, threonine and methionine, i.e., an aspartokinase, (J. Biochem., 65: 849-859 (1969)). The biosyn- 
thesis of arginine is controlled by repressing the expression of its biosynthesis gene by arginine so as not to biosyn- 
thesize an excessive amount of arginine (Microbiology, 142: 99-108 (1996)). It is considered that these metabolic 
regulatory mechanisms are deregulated in amino acid-producing mutants. Similarly, the metabolic regulation is dereg- 
ulated in mutants producing nucleic acids, vitamins, saccharides, organic acids and analogues of the above-described 

35 substances so as to improve the productivity of the objective product. 

[0005] However, accumulation of basic genetic, biochemical and molecular biological data on coryneform bacteria 
is insufficient in comparison with Escherichia coli, Bacillus subtiiis f and the like. Also, few findings have been obtained 
on mutated genes in amino acid-producing mutants. Thus, there are various mechanisms, which are still unknown, of 
regulating the growth and metabolism of these microorganisms. 

40 [0006] A chromosomal physical map of Corynebacterium glutamicum ATCC 13032 is reported and it is known that 
its genome size is about 3,1 00 kb (/Wo/. Gen. Genet., 252: 255-265 (1 996)). Calculating on the basis of the usual gene 
density of bacteria, it is presumed that about 3,000 genes are present in this genome of about 3,100 kb. However, only 
about 100 genes mainly concerning amino acid biosynthesis genes are known in Corynebacterium glutamicum, and 
the nucleotide sequences of most genes have not been clarified hitherto. 

45 [0007] In recent years, the full nucleotide sequence of the genomes of several microorganisms, such as Escherichia 
coli, Mycobacterium tuberculosis, yeast, and the like, have been determined (Science, 277: 1453-62 (1997); Nature, 
393: 537-544 (1998); Nature, 387: 5-105 (1997)). Based on the thus determined full nucleotide sequences, assumption 
of gene regions and prediction of their function by comparison with the nucleotide sequences of known genes have 
been carried out. Thus, the functions of a great number of genes have been presumed, without genetic, biochemical 

so or molecular biological experiments. 

[0008] In recent years, moreover, techniques for monitoring expression levels of a great number of genes simulta- 
neously or detecting mutations, using DNA chips, DNA arrays or the like in which a partial nucleic acid fragment of a 
gene or a partial nucleic acid fragment in genomic DNA other than a gene is fixed to a solid support, have been 
developed. The techniques contribute to the analysis of microorganisms, such as yeasts, Mycobacterium tuberculosis, 

55 Mycobacterium bovis used in BCG vaccines, and the like (Science, 278: 680-686 (1 997); Proc. Nati Acad. Sci. USA, 
96: 12833-38 (1999); Science, 284: 1520-23 (1999)). 
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SUMMARY OF THE INVENTION 

[0009] An object of the present invention is to provide a polynucleotide and a polypeptide derived from a microor- 
ganism of coryneform bacteria which are industrially useful, sequence information of the polynucleotide and the 
5 polypeptide, a method for analyzing the microorganism, an apparatus and a system for use in the analysis, and a 
method for breeding the microorganism. 

[0010] The present invention provides a polynucleotide and an oligonucleotide derived from a microorganism be- 
longing to coryneform bacteria, oligonucleotide arrays to which the polynucleotides and the oligonucleotides are fixed, 
a polypeptide encoded by the polynucleotide, an antibody which recognizes the polypeptide, polypeptide arrays to 
10 which the polypeptides or the antibodies are fixed, a computer readable recording medium in which the nucleotide 
sequences of the polynucleotide and the oligonucleotide and the amino acid sequence of the polypeptide have been 
recorded, and a system based on the computer using the recording medium as well as a method of using the polynu- 
cleotide and/or polypeptide sequence information to make comparisons. 

15 BRIEF DESCRIPTION OF THE DRAWING 

[001 1] Fig. 1 is a map showing the positions of typical genes on the genome of Corynebacterium glutamicum ATCC 
13032. 

[0012] Fig. 2 is electrophoresis showing the results of proteome analyses using proteins derived from (A) Coryne- 
20 bacterium glutamicum ATCC 13032, (B) FERM BP-7134, and (C) FERM BP-158. 

[0013] Fig. 3 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

[0014] Fig. 4 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

[0015] This application is based on Japanese applications No. HeL 11-377484 filed on December 16, 1999 : No. 
2000-159162 filed on April 7, 2000 and No. 2000-280988 filed on August 3, 2000, the entire contents of which are 

30 incorporated hereinto by reference. 

[0016] From the viewpoint that the determination of the full nucleotide sequence of Corynebacterium glutamicum 
would make it possible to specify gene regions which had not been previously identified, to determine the function of 
an unknown gene derived from the microorganism through comparison with nucleotide sequences of known genes 
and amino acid sequences of known genes, and to obtain a useful mutant based on the presumption of the metabolic 

35 regulatory mechanism of a useful product by the microorganism, the inventors conducted intensive studies and, as a 
result, found that the complete genome sequence of Corynebacterium glutamicum can be determined by applying the 
whole genome shotgun method. 

[0017] Specifically, the present invention relates to the following (1) to (65): 
40 (1 ) A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 
45 (D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
50 from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

one of SEQ ID NOS:1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
55 ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 

labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 
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As used herein, for example, the at least two polynucleotides can be at least two of the first polynu- 
cleotides, at least two of the second polynucleotides, at least two of the third polynucleotides, or at least 
two of the first, second and third polynucleotides. 

5 (2) The method according to (1), wherein the coryneform bacterium is a microorganism belonging to the genus 

Corynebacterium, the genus Brevibactehum, or the genus Microbactehum. 

(3) The method according to (2), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium lilium, Corynebacteri- 

10 urn metassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(4) The method according to (1 ), wherein the polynucleotide derived from a coryneform bacterium, the polynuce- 
lotide derived from a mutant of the coryneform bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid, and analogues thereof. 

15 (5) The method according to (1 ), wherein the polynucleotide to be examined is derived from Escherichia coil 

(6) A polynucleotide array, comprising: 

at least two polynucleotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID NOS:1 to 3501 , second polynucleotides which hybridize 
20 with the first polynucleotides under stringent conditions, and third polynucleotides comprising 1 0 to 200 con- 

tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

As used herein, for example, the at least two polynucleotides can be at least two of the first polynucleotides, 
25 at least two of the second polynucleotides, at least two of the third polynucleotides, or at least two of the first, 

second and third polynucleotides. 

(7) A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO:1 or a polynucleotide having 
a homology of at least 80% with the polynucleotide. 

(8) A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID NOS:2 to 3431 , or 
30 a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

(9) A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931 , or a polynucleotide which hybridizes therewith under stringent conditions. 

(10) A polynucleotide which is present in the 5' upstream or 3' downstream of a polynucleotide comprising the 
nucleotide sequence of any one of SEQ ID NOS:2 to 3431 in a whole polynucleotide comprising the nucleotide 

35 sequence represented by SEQ ID NO:1 , and has an activity of regulating an expression of the polynucleotide. 

(11) A polynucleotide comprising 10 to 200 continuous bases in the nucleotide sequence of the polynucleotide of 
any one of (7) to (1 0), or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 

(12) A recombinant DNA comprising the polynucleotide of any one of (8) to (11). 

40 (13) A transformant comprising the polynucleotide of any one of (8) to (11) or the recombinant DNA of (12). 

(14) A method for producing a polypeptide, comprising: 

culturing the transformant of (13) in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of (8) or (9) in the medium, and 
45 recovering the polypeptide from the medium. 

(1 5) A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 
and analogues thereof, comprising: 

so culturing the transformant of (13) in a medium to produce and accumulate at least one of an amino acid, a 

nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 
and analogues thereof from the medium. 

55 (1 6) A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS: 

2 to 3431 . 

(17) A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931 . 

(18) The polypeptide according to (16) or (17), wherein at least one amino acid is deleted, replaced, inserted or 
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added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition. 

(19) A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid 
sequence of the polypeptide of (16) or (17), and having an activity which is substantially the same as that of the 

5 polypeptide. 

(20) An antibody which recognizes the polypeptide of any one of (16) to (19). 

(21) A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of (16) to (19) and 
10 partial fragment polypeptides of the polypeptides, and 

a solid support adhered thereto. 

(22) A polypeptide array, comprising: 

15 at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 

tides of (16) to (19) and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(23) A system based on a compuler for identifying a target sequence or a target structure molif derived from a 
20 coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

25 (iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 

1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which is coincident with or analogous to- the 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

30 

(24) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 , target se- 
35 quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 

(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
40 target sequence or target structure motif information. 

(25) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

45 (») a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 

3502 to 7001 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 

50 device for screening and analyzing amino acid sequence information which is coincident with or analogous to 

the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

(26) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
55 coryneform bacterium : comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
sequence information or target structure motif information into a user input device; 
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(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which is coincident with or analogous to the 
5 target sequence or target structure motif information. 

(27) A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having 
a target nucleotide sequence derived from a coryneform bacterium, comprising the following: 

10 (i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 

to 3501, function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
15 2 to 3501 with the target nucleotide sequence information , and determining a function of a polypeptide encoded 

by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

20 (28) A method based on a computer for determining a function of a polypeptide encoded by a polypeptide encoded 

by a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the 
following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 , function in- 
25 formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
30 which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 

from SEQ ID NOS:2 to 3501 . 

(29) A system based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

35 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 

(ii) a data storing device for at least temporarily storing the input information; 

40 (iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 

3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 
(iv) an output device that shows a function obtained by the comparator. 

45 

(30) A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
so information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target amino acid sequence information; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
55 analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 

7001. 

(31) The system according to any one of (23), (25), (27) and (29), wherein a coryneform bacterium is a microor- 
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ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

(32) The method according to any one of (24), (26), (28) and (30), wherein a coryneform bacterium is a microor- 
ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

(33) The system according to (31 ), wherein the microorganism belonging to the genus Corynebacterium \s selected 
5 from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophitum, Corynebacterium 

acetoglutamicum, corynebacterium callunae, corynebacterium herculis, Corynebacterium lilium, Corynebacterium 
meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(34) The method according to (32), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

10 acetoglutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacterh 

um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(35) A recording medium or storage device which is readable by a computer in which at least one nucleotide 
sequence information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide se- 
quence is recorded, and is usable in the system of (23) or (27) or the method of (24) or (28). 

15 (36) A recording medium or storage device which is readable by a computer in which at least one amino acid 

sequence information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid 
sequence is recorded, and is usable in the system of (25) or (29) or the method of (26) or (30). 

(37) The recording medium or storage device according to 

(35) or (36), which is a computer readable recording medium selected from the group consisting of a floppy disc, 
20 a hard disc, a magnetic tape : a random access memory (RAM), a read only memory (ROM), a magneto-optic disc 

(MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM and DVD-RW. 

(38) A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the 
Val residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Val residue. 

25 (39) A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino 

acid sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

(40) The polypeptide according to (38) or (39), wherein the Val residue at the 59th position is replaced with an Ala 
residue. 

(41) A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro 
30 residue at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform 

bacterium is replaced with an amino acid residue other than a Pro residue. 

(42) A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino 
acid sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

(43) The polypeptide according to (41 ) or (42), wherein the Pro residue at the 458th position is replaced with a Ser 
35 residue. 

(44) The polypeptide according to any one of (38) to (43), which is derived from Corynebacterium glutamicum. 

(45) A DNA encoding the polypeptide of any one of (38) to (44). 

(46) A recombinant DNA comprising the DNA of (45). 

(47) A transformant comprising the recombinant DNA of (46). 

40 (48) A transformant comprising in its chromosome the DNA of (45). 

(49) The transformant according to (47) or (48), which is derived from a coryneform bacterium. 

(50) The transformant according to (49), which is derived from Corynebacterium glutamicum. 

(51 ) A method for producing L-lysine, comprising: 

45 culturing the transformant of any one of (47) to (50) in a medium to produce and accumulate L-lysine in the 

medium, and 

recovering the L-lysine from the culture. 

(52) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
so SEQ ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 

55 method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
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bacterium obtained in (iii). 

(53) The method according to (52), wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

5 (54) The method according to (52), wherein the mutation point is a mutation point relating to a useful mutation 

which improves or stabilizes the productivity. 

(55) A method for breading a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:1 to 3431 , comprising: 

w (i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 

rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 
(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

15 (jii) deleting a mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

(56) The method according to (55), wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
20 a signal transmission pathway. 

(57) The method according to (55), wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

(58) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 , comprising the following: 

25 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID NOS:2 to 3431 ; 

(ii) classifying the isozyme identified in (i) into an isozyme having the same activity; 

30 (iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a fermentation method of the compound selected in (i) of the coryneform bac- 
terium which have been transformed with the gene obtained in (iii). 

(59) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
35 SEQ ID NOS:2 to 3431 , comprising the following: 

(i) arranging a function information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 

(ii) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 
pathway; 

40 (jii) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 

in combination with information relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

(iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a coryneform bacterium based on the nucleotide sequence information to either 
45 strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 

weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

(60) A coryneform bacterium, bred by the method of any one of (52) to (59). 

(61) The coryneform bacterium according to (60), which is a microorganism belonging to the genus Corynebac- 
50 terium, the genus Brevibacterium, or the genus Microbacterium. 

(62) The coryneform bacterium according to (61), wherein the microorganism belonging to the genus Corynebac- 
terium is selected from the group consisting of Coryncbactorium glutamicum, Corynebacterium acetoacidophilum, 
Corynebacterium acetoglutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lil- 
ium, Corynebacterium meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

55 (63) A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a 

saccharide, an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of (60) to (62) in a medium to produce and accumulate at least 
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one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof; 

recovering the compound from the culture. 

(64) The method according to (63), wherein the compound is L-lysine. 

(65) A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the 
following: 

(i) preparing 

a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 



(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(Hi) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
20 to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

25 As used herein, the term "proteome", which is a coined word by combining "protein" with "genome", refers to 

a method for examining of a gene at the polypeptide level. 

(66) The method according to (65) : wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibactehum, or the genus Microbacterium. 

(67) The method according to (66) , wherein the microorganism belonging to the genus Corynebacterium is selected 
30 from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

acetogiutamicum, Corynebacterium callunae, corynebacterium herculis, Corynebacterium til ium Corynebacterium 
melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(68) A biologically pure culture of Corynebacterium glutamicum AHP-3 (FERM BP-7382). 

35 [0018] The present invention will be described below in more detail, based on the determination of the full nucleotide 
sequence of coryneform bacteria. 

1 . Determination of full nucleotide sequence of coryneform bacteria 

40 [0019] The term "coryneform bacteria" as used herein means a microorganism belonging to the genus Corynebac- 
terium, the genus Brevibacterium or the genus Microbacterium as defined in Bergeys Manual of Determinative Bacte- 
riology 8: 599 (1974). 

[0020] Examples include Corynebacterium acetoacidophilum, Corynebacterium acetogiutamicum, Corynebacterium 
callunae, Corynebacterium glutamicum, Corynebacterium herculis, Corynebacterium lilium, Corynebacterium melas- 
secola, Corynebacterium thermoaminogenes, Brevibacterium saccharolyticum, Brevibacterium immariophilum, Brevi- 
bacterium roseum, Brevibacterium thiogenitalis, Microbacterium ammoniaphilum, and the like. 

[0021 ] Specific examples include Corynebacterium acetoacidophilum ATCC 1 3870, Corynebacterium acetogiutami- 
cum ATCC 15806, Corynebacterium callunae ATCC 15991, Corynebacterium glutamicum ATCC 13032, Corynebac- 
terium glutamicum ATCC 13060, Corynebacterium glutamicum ATCC 13826 (prior genus and species: Brevibacterium 
ftavum, or Corynebacterium lactofermentum), Corynebacterium glutamicum ATCC 14020 (prior genus and species: 
Brevibacterium divaricatum), Corynebacterium glutamicum ATCC 13869 (prior genus and species: Brevibacterium 
lactofermentum), Corynebacterium herculis ATCC 13868, Corynebacterium lilium ATCC 15990, Corynebacterium 
melassecola ATCC 17965, Corynebacterium thermoaminogenes FERM 9244, Brevibacterium saccharolyticum ATCC 
14066, Brevibacterium immariophilum ATCC 14068, Brevibacterium roseum ATCC 13825, Brevibacterium thiogenitalis 
ATCC 1 9240, Microbacterium ammoniaphilum ATCC 1 5354, and the like. 
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(1) Preparation of genome DNA of coryneform bacteria 

[0022] Coryneform bacteria can be cultured by a conventional method. 

[0023] Any of a natural medium and a synthetic medium can be used, so long as it is a medium suitable for efficient 
5 culturing of the microorganism, and it contains a carbon source, a nitrogen source, an inorganic salt, and the like which 
can be assimilated by the microorganism. 

[0024] In Corynebacterium glutamicum, for example, a BY medium (7 g/l meat extract, 1 0 g/l peptone, 3 g/l sodium 
chloride, 5 g/l yeast extract, pH 7.2) containing 1% of glycine and the like can be used. The culturing is carried out at 
25 to 35°C overnight. 

10 [0025] After the completion of the culture, the cells are recovered from the culture by centrifugation. The resulting 
cells are washed with a washing solution. 

[0026] Examples of the washing solution include STE buffer (1 0.3% sucrose, 25 mmol/l Tris hydrochloride, 25 mmol/ 
I ethylenediaminetetraacetic acid (hereinafter referred to as "EDTA"), pH 8.0), and the like. 

[0027] Genome DNA can be obtained from the washed cells according to a conventional method for obtaining ge- 
15 nome DNA, namely, lysing the cell wall of the cells using a lysozyme and a surfactant (SDS, etc.), eliminating proteins 
and the like using a phenol solution and a phenol/chloroform solution, and then precipitating the genome DNA with 
ethanol or the like. Specifically, the following method can be illustrated. 

[0028] The washed cells are suspended in a washing solution containing 5 to 20 mg/l lysozyme. After shaking : 5 to 
20% SDS is added to lyse the cells. In usual, shaking is gently performed al 25 to 40°C for 30 minutes to 2 hours. After 
20 shaking, the suspension is maintained at 60 to 70°C for 5 to 1 5 minutes for the lysis. 

[0029] After the lysis, the suspension is cooled to ordinary temperature, and 5 to 20 ml of Tris-neutralized phenol is 
added thereto : followed by gently shaking at room temperature for 15 to 45 minutes. 

[0030] After shaking, centrifugation (15,000 x g, 20 minutes, 20°C) is carried out to fractionate the aqueous layer. 
[0031] After performing extraction with phenol/chloroform andextraction with chloroform (twice) in the same manner, 

25 3 mol/l sodium acetate solution (pH 5.2) and isopropanol are added to the aqueous layer at 1/10 times volume and 2 
times volume, of the aqueous layer, respectively, followed by gently stirring to precipitate the genome DNA. 
[0032] The genome DNA is dissolved again in a buffer containing 0.01 to 0.04 mg/ml RNase. As an example of the 
buffer, TE buffer (10 mmol/l Tris hydrochloride, 1 mol/l EDTA, pH 8.0) can be used. After dissolving, the resultant 
solution is maintained at 25 to 40°C for 20 to 50 minutes and then extracted successively with phenol, phenol/chloroform 

30 and chloroform as in the above case. 

[0033] After the extraction, isopropanol precipitation is carried out and the resulting DNA precipitate is washed with 
70% ethanol, followed by air drying, and then dissolved in TE buffer to obtain a genome DNA solution. 

(2) Production of shotgun library 

35 

[0034] A method for produce a genome DNA library using the genome DNA of the coryneform bacteria prepared in 
the above (1 ) include a method described in Molecular Cloning, A laboratory Manual, Second Edition (1989) (hereinafter 
referred to as "Molecular Cloning, 2nd ed"). In particular, the following method can be exemplified to prepare a genome 
DNA library appropriately usable in determining the full nucleotide sequence by the shotgun method. 
40 [0035] To 0.01 mg of the genome DNA of the coryneform bacteria prepared in the above (1) , a buffer, such as TE 
buffer or the like, is added to give a total volume of 0.4 ml. Then, the genome DNA is digested into fragments of 1 to 
10 kb with a sonicator (Yamato Powersonic Model 50). The treatment with the sonicator is performed at an output of 
20 continuously for 5 seconds. 

[0036] The resulting genome DNA fragments are blunt-ended using DNA blunting kit (manufactured by Takara Shuzo) 
45 or the like. 

[0037] The blunt-ended genome fragments are fractionated by agarose gel or poly aery lamide gel electrophoresis 
and genome fragments of 1 to 2 kb are cut out from the gel. 

[0038] To the gel, 0.2 to 0.5 ml of a buffer for eluting DNA, such as MG elution buffer (0.5 mol/l ammonium acetate, 
10 mmol/l magnesium acetate, 1 mmol/l EDTA, 0.1% SDS) or the like, is added, followed by shaking at 25 to 40°C 
50 overnight to elute DNA. 

[0039] The resulting DNA eluate is treated with phenol/chloroform and then precipitated with ethanol to obtain a 
genome library insert. 

[0040] This insert is ligated into a suitable vector, such as pUC1 8 Smal/SAP (manufactured by Amersham Pharmacia 
Biotech) or the like, using T4 ligase (manufactured by Takara Shuzo) or the like. The ligation can be carried out by 
55 allowing a mixture to stand at 1 0 to 20°C for 20 to 50 hours. 

[0041] The resulting ligation product is precipitated with ethanol and dissolved in 5 to 20 uJ of TE buffer. 

[0042] Escherichia coli is transformed in accordance with a conventional method using 0.5 to 2 ul of the ligation 

solution. Examples of the transformation method include the electroporation method using ELECTRO MAX DHIOB 
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(manufactured by Life Technologies) for Escherichia coii. The electroporation method can be carried out under the 
conditions as described in the manufacturer's instructions. 

[0043] The transformed Escherichia coli is spread on a suitable selection medium containing agar, for example, LB 
plate medium containing 10 to 100 mg/l ampicillin (LB medium (10 g/l bactotrypton, 5 g/l yeast extract, 10 g/l sodium 
5 chloride, pH 7.0) containing 1 .6% of agar) when pUC18 is used as the cloning vector, and cultured therein. 

[0044] The transformant can be obtained as colonies formed on the plate medium. In this step, it is possible to select 
the transformant having the recombinant DNA containing the genome DNA as white colonies by adding X-gal and 
IPTG (isopropyl-p-thiogalactopyranoside) to the plate medium. 

[0045] The transformant is allowed to stand for culturing in a 96-well titer plate to which 0.05 ml of the LB medium 
10 containing 0.1 mg/ml of ampicillin has been added in each well. The resulting culture can be used in an experiment of 
(4) described below. Also, the culture solution can be stored at -80°C by adding 0.05 ml per well of the LB medium 
containing 20% glycerol to the culture solution, followed by mixing, and the stored culture solution can be used at any 
time. 

is (3) Production of cosmid library 

[0046] The genome DNA (0.1 mg) of the coryneform bacteria prepared in the above (1) is partially digested with a 
restriction enzyme, such as Sau3A\ or the like, and then ultracentrifuged (26,000 rpm, 18 hours, 20°C) under a 10 to 
40% sucrose density gradient using a 10% sucrose buffer (1 mol/l Nacl, 20 mmol/l Tris hydrochloride, 5 mmol/l EDTA, 
20 10% sucrose, pH 8.0) and a 40% sucrose buffer (elevating the concentration of the 10% sucrose buffer to 40%). 

[0047] After the centrifugation : the thus separated solution is fractionated into tubes in 1 ml per each tube. After 
confirming the DNA fragment size of each fraction by agarose gel electrophoresis, a fraction rich in DNA fragments of 
about 40 kb is precipitated with ethanol. 

[0048] The resulting DNA fragment is ligated to a cosmid vector having a cohesive end which can be ligated to the 
25 fragment. When the genome DNA is partially digested with Sau3AI, the partially digested product can be ligated to, 
for example, the BamH\ site of superCosI (manufactured by Stratagene) in accordance with the manufacture's instruc- 
tions. 

[0049] The resulting ligation product is packaged using a packaging extract which can be prepared by a method 
described in Molecular Cloning, 2nd ed. and then used in transforming Escherichia coli. More specifically, the ligation 
30 product is packaged using, for example, a commercially available packaging extract, Gigapack III Gold Packaging 
Extract (manufactured by Stratagene) in accordance- with the manufacture's instructions and then introduced into Es- 
cherichia coli XL-1 -BlueMR (manufactured by Stratagene) or the like. 

[0050] The thus transformed Escherichia coli is spread on an LB plate medium containing ampicillin, and cultured 
therein. 

35 [0051] The transformant can be obtained as colonies formed on the plate medium. 

[0052] The transformant is subjected to standing culture in a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mg/ml ampicillin has been added. 

[0053] The resulting culture can be employed in an experiment of (4) described below. Also, the culture solution can 
be stored at -80°C by adding 0.05 ml per well of the LB medium containing 20% glycerol to the culture solution, followed 
40 by mixing, and the stored culture solution can be used at any time. 

(4) Determination of nucleotide sequence 

(4-1 ) Preparation of template 

45 

[0054] The full nucleotide sequence of genome DNA of coryneform bacteria can be determined basically according 
to the whole genome shotgun method (Science, 269: 496-512 (1995)). 

[0055] The template used in the whole genome shotgun method can be prepared by PCR using the library prepared 
in the above (2) (DNA Research, 5: 1-9 (1998)). 
50 [0056] Specifically, the template can be prepared as follows. 

[0057] The clone derived from the whole genome shotgun library is inoculated by using a replicator (manufactured 
by GEN ETIX) into each well of a 96-well plate to which 0.08 ml per well of the LB medium containing 0.1 mg/ml ampicillin 
has been added, followed by stationarily culturing at 37°C overnight. 

[0058] Next, the culture solution is transported, using a copy plate (manufactured by Tokken), into each well of a 
55 96-well reaction plate (manufactured by PE Biosystems) to which 0.025 ml per well of a PCR reaction solution has 
been added using TaKaRa Ex Taq (manufactured by Takara Shuzo). Then, PCR is carried out in accordance with the 
protocol by Makino et ai (DNA Research, 5: 1-9 (1998)) using GeneAmp PCR System 9700 (manufactured by PE 
Biosystems) to amplify the inserted fragments. 
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[0059] The excessive primers and nucleotides are eliminated using a kit for purifying a PCR product, and the product 
is used as the template in the sequencing reaction. 

[0060] It is also possible to determine the nucleotide sequence using a double-stranded DNA plasmid as a template. 

[0061] The double-stranded DNA plasmid used as the template can be obtained by the following method. 
5 [0062] The clone derived from the whole genome shotgun library is inoculated into each well of a 24- or 96-well plate 

to which 1 .5 ml per well of a 2 x YT medium (16 g/l bactotrypton, 10 g/l yeast extract, 5 g/l sodium chloride, pH 7.0) 

containing 0.05 mg/ml ampicillin has been added, followed by culturing under shaking at 37°C overnight. 

[0063] The double-stranded DNA plasmid can be prepared from the culture solution using an automatic plasmid 

preparing machine KURABO PI-50 (manufactured by Kurabo Industries), a multiscreen (manufactured by Millipore) 
10 or the like, according to each protocol. 

[0064] To purify the plasmid, Biomek 2000 manufactured by Beckman Coulter and the like can be used. 

[0065] The resulting purified double-stranded DNA plasmid is dissolved in water to give a concentration of about 0.1 

mg/ml. Then, it can be used as the template in sequencing. 

15 (4-2) Sequencing reaction 

[0066] The sequencing reaction can be carried out according to a commercially available sequence kit or the like. A 
specific method is exemplified below. 

[0067] To 6 u.1 of a solution of ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
20 by PE Biosystems), 1 to 2 pmol of an M13 regular direction primer (M13-21) or an M13 reverse direction primer 
(MI3REV) (DNA Research, 5. 1-9 (1998)) and 50 to 200 ng of the template prepared in the above (4-1) (the PCR 
product or plasmid) to give 10 uJ of a sequencing reaction solution. 

[0068] A dye terminator sequencing reaction (35 to 55 cycles) is carried out using this reaction solution and GeneAmp 
PCR System 9700 (manufactured by PE Biosystems) or the like. The cycle parameter can be determined in accordance 
25 with a commercially available kit, for example, the manufacture's instructions attached with ABI PRISM Big Dye Ter- 
minator Cycle Sequencing Ready Reaction Kit. 

[0069] The sample can be purified using a commercially available product, such as Multi Screen HV plate (manu- 
factured by Millipore) or the like, according to the manufacture's instructions. 

[0070] The thus purified reaction product is precipitated with ethanol, dried and then used for the analysis. The dried 
30 reaction product can be stored in the dark at -30°C and the stored reaction product can be used at any time. 

[0071] The dried reaction product can be analyzed using a commercially available sequencer and an analyzer ac- 
cording to the manufacture's instructions. 

[0072] Examples of the commercially available sequencer include ABI PRISM 377 DNA Sequencer (manufactured 
by PE Biosystems). Example of the analyzer include ABI PRISM 3700 DNA Analyzer (manufactured by PE Biosystems). 

35 

(5) Assembly 

[0073] A software, such as phred (The University of Washington) or the like, can be used as base call for use in 
analyzing the sequence information obtained in the above (4). A software, such as Cross_Match (The University of 
40 Washington) or SPS Cross_Match (manufactured by Southwest Parallel Software) or the like, can be used to mask 
the vector sequence information. 

[0074] For the assembly, a software, such as phrap (The University of Washington), SPS phrap (manufactured by 
Southwest Parallel Software) or the like, can be used. 

[0075] In the above, analysis and output of the results thereof, a computer such as UNIX, PC, Macintosh, and the 
45 |jke can be used. 

[0076] Contig obtained by the assembly can be analyzed using a graphical editor such as consed (The University 
of Washington) or the like. 

[0077] It is also possible to perform a series of the operations from the base call to the assembly in a lump using a 
script phredPhrap attached to the consed. 
50 [0078] As used herein, software will be understood to also be referred to as a comparator. 

(6) Determination of nucleotide sequence in gap part 

[0079] Each of the cosmids in the cosmid library constructed in the above (3) is prepared in the same manner as in 
55 the preparation of the double-stranded DNA plasmid described in the above (4-1 ). The nucleotide sequence at the end 
of the insert fragment of the cosmid is determined using a commercially available kit, such as ABI PRISM BigDye 
Terminator Cycle Sequencing Ready Reaction Kit (manufactured by PE Biosystems) according to the manufacture's 
instructions. 
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[0080] About 800 cosmid clones are sequenced at both ends of the inserted fragment to detect a nucleotide sequence 
in the contig derived from the shotgun sequencing obtained in (5) which is coincident with the sequence. Thus, the 
chain linkage between respective cosmid clones and respective contigs are clarified, and mutual alignment is carried 
out. Furthermore, the results are compared with known physical maps to map the cosmids and the contigs. In case of 
5 Corynebacterium glutamicum ATCC 13032, a physical map of Mol. Gen. Genet, 252:. 255-265 (1996) can be used. 
[0081] The sequence in the region which cannot be covered with the contigs (gap part) can be determined by the 
following method. 

[0082] Clones containing sequences positioned at the ends of the contigs are selected. Among these, aclone wherein 
only one end of the inserted fragment has been determined is selected and the sequence at the opposite end of the 
10 inserted fragment is determined. 

[0083] A shotgun library clone or a cosmid clone derived therefrom containing the sequences at the respective ends 
of the inserted fragments in the two contigs is identified and the full nucleotide sequence of the inserted fragment of 
the clone is determined. 

[0084] According to this method, the nucleotide sequence of the gap part can be determined. 
is [0085] When no shotgun library clone or cosmid clone covering the gap part is available, primers complementary to 

the end sequences of the two different contigs are prepared and the DNA fragment in the gap part is amplified. Then, 

sequencing is performed by the primer walking method using the amplified DNA fragment as a template or by the 

shotgun method in which the sequence of a shotgun clone prepared from the amplified DNA fragment is determined. 

Thus, the nucleotide sequence of the above-described region can be determined. 
20 [0086] In a region showing a low sequence accuracy, primers are synthesized using AUTOFINISH function and 

NAVIGATING function of consed (The University of Washington), and the sequence is determined by the primer walking 

method to improve the sequence accuracy. 

[0087] Examples of the thus determined nucleotide sequence of the full genome include the full nucleotide sequence 
of genome of Corynebacterium glutamicum ATCC 13032 represented by SEQ ID NO:1 . 

25 

(7) Determination of nucleotide sequence of microorganism genome DNA using the nucleotide sequence represented 
by SEQ ID NO:1 

[0088] A nucleotide sequence of a polynucleotide having a homology of 80% or more with the full nucleotide sequence 
30 of Corynebacterium glutamicum ATCC 13032 represented by SEQ ID NO:1 as determined above can also be deter- 
mined using the nucleotide sequence represented by SEQ ID NO:1, and the polynucleotide having a nucleotide se- 
quence having a homology of 80% or more with the nucleotide sequence represented by SEQ ID NO:1 of the present 
invention is within the scope of the present invention. The term "polynucleotide having a nucleotide sequence having 
a homology of 80% or more with the nucleotide sequence represented by SEQ ID NO:1 of the present invention" is a 
35 polynucleotide in which a full nucleotide sequence of the chromosome DNA can be determined using as a primer an 
oligonucleotide composed of continuous 5 to 50 nucleotides in the nucleotide sequence represented by SEQ ID NO: 

1 , for example, according to PCR using the chromosome DNA as a template. A particularly preferred primer in deter- 
mination of the full nucleotide sequence is an oligonucleotide having nucleotide sequences which are positioned at 
the interval of about 300 to 500 bp, and among such oligonucleotides, an oligonucleotide having a nucleotide sequence 

40 selected from DNAs encoding a protein relating to a main metabolic pathway is particularly preferred. The polynucle- 
otide in which the full nucleotide sequence of the chromosome DNA can be determined using the oligonucleotide 
includes polynucleotides constituting a chromosome DNA derived from a microorganism belonging to coryneform bac- 
teria. Such a polynucleotide is preferably a polynucleotide constituting chromosome DNA derived from a microorganism 
belonging to the genus Corynebacterium, more preferably a polynucleotide constituting a chromosome DNA of Co- 

^5 rynebacterium glutamicum. 

2. Identification of ORF (open reading frame) and expression regulatory fragment and determination of the function of 
ORF 

so [0089] Based on the full nucleotide sequence data of the genome derived from coryneform bacteria. determined in 
the above item 1, an ORF and an expression modulating fragment can be identified. Furthermore, the function of the 
thus determined ORF can be determined. 

[0090] The ORF means a continuous region in the nucleotide sequence of mRNA which can be translated as an 
amino acid sequence to mature to a protein. A region of the DNA coding for the ORF of mRNA is also called ORF. 
55 [0091] The expression modulating fragment (hereinafter referred to as "EMF") is used herein to define a series of 
polynucleotide fragments which modulate the expression of the ORF or another sequence ligated operatably thereto. 
The expression "modulate the expression of a sequence ligated operatably" is used herein to refer to changes in the 
expression of a sequence due to the presence of the EMF. Examples of the EMF include a promoter, an operator, an 
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enhancer, a silencer, a ribosome-binding sequence, a transcriptional termination sequence, and the like. In coryneform 
bacteria, an EMF is usually present in an intergenic segment (a fragment positioned between two genes; about 10 to 
200 nucleotides in length). Accordingly, an EMF is frequently present in an intergenic segment of 10 nucleotides or 
longer. It is also possible to determine or discover the presence of an EMF by using known EMF sequences as a target 
5 sequence or a target structural motif (or a target motif) using an appropriate software or comparator, such as FASTA 
(Proc. Natl. Acad Sci. USA, 85: 2444-48 (1988)), BLAST (J. Mol. Biol., 275:403-410 (1990)) or the like. Also, it can 
be identified and evaluated using a known EMF-capturing vector (for example, pKK232-8; manufactured by Amersham 
Pharmacia Biotech). 

[0092] The term "target sequence" is used herein to refer to a nucleotide sequence composed of 6 or more nucle- 
10 otides, an amino acid sequence composed of 2 or more amino acids, or a nucleotide sequence encoding this amino 
acid sequence composed of 2 or more amino acids. A longer target sequence appears at random in a data base at 
the lower possibility. The target sequence is preferably about 1 0 to 1 00 amino acid residues or about 30 to 300 nucle- 
otide residues. 

[0093] The term "target structural motif" or "target motif" is used herein to refer to a sequence or a combination of 
75 sequences selected optionally and reasonably. Such a motif is selected on the basis of the threedimensional structure 
formed by the folding of a polypeptide by means known to one of ordinary skill in the art. Various motives are known. 
[0094] Examples of the target motif of a polypeptide include, but are not limited to, an enzyme activity site, a protein- 
protein interaction site, a signal sequence, and the like. Examples of the target motif of a nucleic acid include a promoter 
sequence, a transcriptional regulatory factor binding sequence, a hair pin structure, and the like. 
20 [0095] Examples of highly useful EMF include a high-expression promoter, an inducible-expression promoter, and 
the like. Such an EMF can be obtained by positionally determining the nucleotide sequence of a gene which is known 
or expected as achieving high expression (for example, ribosomal RNA gene: GenBank Accession No. M16175 or 
Z46753) or a gene showing a desired induction pattern (for example, isocitrate lyase gene induced by acetic acid: 
Japanese Published Unexamined Patent Application No. 56782/93) via the alignment with the full genome nucleotide 
25 sequence determined in the above item 1 , and isolating the genome fragment in the upstream part (usually 200 to 500 
nucleotides from the translation initiation site). It is also possible to obtain a highly useful EMF by selecting an EMF 
showing a high expression efficiency or a desired induction pattern from among promoters captured by the EMF- 
capturing vector as described above. 

[0096] The ORF can be identified by extracting characteristics common to individual ORFs, constructing a general 
30 model based on these characteristics, and measuring the conformity of the subject sequence with the model. In the 
identification, a software, such as GeneMark (Nuc. Acids. Res., 22: 4756-67 (1994): manufactured by GenePro)), 
GeneMark.hmm (manufactured by GenePro), GeneHacker (Protein, Nucleic Acid and Enzyme, 42: 3001-07 (1997)), 
Glimmer (Nuc. Acids. Res., 26: 544-548 (1998): manufactured by The Institute of Genomic Research), or the like, can 
be used. In using the software, the default (initial setting) parameters are usually used, though the parameters can be 
35 optionally changed. 

[0097] In the above-described comparisons, a computer, such as UNIX, PC, Macintosh, or the like, can be used. 
[0098] Examples of the ORF determined by the method of the present invention include ORFs having the nucleotide 
sequences represented by SEQ ID NOS:2 to 3501 present in the genome of Corynebacterium glutamicum as repre- 
sented by SEQ ID NO:1 . In these ORFs, polypeptides having the amino acid sequences represented by SEQ ID NOS: 

40 3502 to 7001 are encoded. 

[0099] The function of an ORF can be determined by comparing the identified amino acid sequence of the ORF with 
known homologous sequences using a homology searching software or comparator, such as BLAST, FAST, Smith & 
Waterman (Meth. Enzym., 164: 765 (1 988)) or the like on an amino acid data base, such as Swith-Prot, PIR, GenBank- 
nr-aa, GenPept constituted by protein-encoding domains derived from GenBank data base, OWL or the like. 

45 [0100] Furthermore, by the homology searching, the identity and similarity with the amino acid sequences of known 
proteins can also be analyzed. 

[0101] With respect of the term "identity" used herein, where two polypeptides each having 10 amino acids are 
different in the positions of 3 amino acids, these polypeptides have an identity of 70% with each other. In case wherein 
one of the different 3 amino acids is analogue (for example, leucine and isoleucine), these polypeptides have a similarity 
50 of 80%. 

[0102] As a specific example, Table 1 shows the registration numbers in known data bases of sequences which are 
judged as having the highest similarity with the nucleotide sequence of the ORF derived from Corynebacterium glutami- 
cum ATCC 1 3032, genes of these sequences, functions of these genes, and identities thereof compared with known 
amino acid translation sequences. 
55 [0103] Thus, a great number of novel genes derived from coryneform bacteria can be identified by determining the 
full nucleotide sequence of the genome derived from coryneform bacterium by the means of the present invention. 
Moreover, the function of the proteins encoded by these genes can be determined. Since coryneform bacteria are 
industrially highly useful microorganisms, many of the identified genes are industrially useful. 
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[0104] Moreover, the characteristics of respective microorganisms can be clarified by classifying the functions thus 
determined. As a result, valuable information in breeding is obtained. 

[0105] Furthermore, from the ORF information derived from coryneform bacteria, the ORF corresponding to the 
microorganism is prepared and obtained according to the general method as disclosed in Molecular Cloning, 2nd ed. 

5 or the like. Specifically, an oligonucleotide having a nucleotide sequence adjacent to the ORF is synthesized, and the 
ORF can be isolated and obtained using the oligonucleotide as a primer and a chromosome DNA derived from co- 
ryneform bacteria as a template according to the general PCR cloning technique. Thus obtained ORF sequences 
include polynucleotides comprising the nucleotide sequence represented by any one of SEQ ID NOS:2 to 3501 . 
[0106] The ORF or primer can be prepared using a polypeptide synthesizer based on the above sequence informa- 

fo tion. 

[0107] Examples of the polynucleotide of the present invention include a polynucleotide containing the nucleotide 
sequence of the ORF obtained in the above, and a polynucleotide which hybridizes with the polynucleotide under 
stringent conditions. 

[0108] The polynucleotide of the present invention can be a single-stranded DNA, a double-stranded DNA and a 

15 single-stranded RNA, though it is not limited thereto. 

[0109] The polynucleotide which hybridizes with the polynucleotide containing the nucleotide sequence of the ORF 
obtained in the above under stringent conditions includes a degenerated mutant of the ORF. A degenerated mutant is 
a polynucleotide fragment having a nucleotide sequence which is different from the sequence of the ORF of the present 
invention which encodes the same amino acid sequence by degeneracy of a gene code. 

20 [0110] Specific examples include a polynucleotide comprising the nucleotide sequence represented by any one of 
SEQ ID NOS:2 to 3431, and a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 
[0111] A polynucleotide which hybridizes under stringent conditions is a polynucleotide obtained by colony hybridi- 
zation, plaque hybridization, Southern blot hybridization or the like using, as a probe, the polynucleotide having the 
nucleotide sequence of the ORF identified in the above. Specific examples include a polynucleotide which can be 

25 identified by carrying out hybridization at 65°C in the presence of 0.7-1 .0 M NaCI using a filter on which a polynucleotide 
prepared from colonies or plaques is immobilized, and then washing the filter with 0.1 x to 2x SSC solution (the com- 
position of Ix SSC contains 150 mM sodium chloride and 15 mM sodium citrate) at 65°C. 

[01 12] The hybridization can be carried out in accordance with known methods described in, for example, Molecular 
Cloning, 2nd ed., Current Protocols in Molecular Biology, DNA Cloning 1: Core Techniques, A Practical Approach, 

30 Second Edition, Oxford University (1995) or the like. Specific examples of the polynucleotide which can be hybridized 
include a DNA having a homology of 60% or more, preferably 80% or more, and particularly preferably 95% or more, 
with the nucleotide sequence represented by any one of SEQ ID NO:2 to 3431 when calculated using default (initial 
setting) parameters of a homology searching software, such as BLAST, FASTA, Smith-Waterman or the like. 
[01 1 3] Also, the polynucleotide of the present invention includes a polynucleotide encoding a polypeptide comprising 

35 the amino acid sequence represented by any one of SEQ ID NOS:3502 to 6931 and a polynucleotide which hybridizes 
with the polynucleotide under stringent conditions. 

[0114] Furthermore, the polynucleotide of the present invention includes a polynucleotide which is present in the 5' 
upstream or 3' downstream region of a polynucleotide comprising the nucleotide sequence of any one of SEQ ID NOS: 
2 to 3431 in a polynucleotide comprising the nucleotide sequence represented by SEQ ID NO:1 , and has an activity 
to of regulating an expression of a polypeptide encoded by the polynucleotide. Specific examples of the polynucleotide 
having an activity of regulating an expression of a polypeptide encoded by the polynucleotide includes a polynucleotide 
encoding the above described EMF, such as a promoter, an operator, an enhancer, a silencer, a ribosome-binding 
sequence, a transcriptional termination sequence, and the like. 

[0115] The primer used for obtaining the ORF according to the above PCR cloning technique includes an oligonu- 
^5 cleotide comprising a sequence which is the same as a sequence of 1 0 to 200 continuous nucleotides in the nucleotide 
sequence of the ORF and an adjacent region or an oligonucleotide comprising a sequence which is complementary 
to the oligonucleotide. Specific examples include an oligonucleotide comprising a sequence which is the same as a 
sequence of 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 
to 3431 , and an oligonucleotide comprising a sequence complementary to the oligonucleotide comprising a sequence 
50 of at least 10 to 20 continuous nucleotide of any one of SEQ ID NOS:1 to 3431 . When the primers are used as a sense 
primer and an antisense primer, the above-described oligonucleotides in which melting temperature (T m ) and the 
number of nucleotides are not significantly different from each other are preferred. 

[0116] The oligonucleotide of the present invention includes an oligonucleotide comprising a sequence which is the 
same as 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 to 
55 3431 or an oligonucleotide comprising a sequence complementary to the oligonucleotide. 

[0117] Also, analogues of these oligonucleotides (hereinafter also referred to as "analogous oligonucleotides 0 ) are 
also provided by the present invention and are useful in the methods described herein. 

[0118] Examples of the analogous oligonucleotides include analogous oligonucleotides in which a phosphodiester 
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bond in an oligonucleotide is converted to a phosphorothioate bond, analogous oligonucleotides in which a phosphodi- 
ester bond in an oligonucleotide is converted to an N3'-P5' phosphoamidate bond, analogous oligonucleotides in which 
ribose and a phosphodiester bond in an oligonucleotide is converted to a peptide nucleic acid bond, analogous oligo- 
nucleotides in which uracil in an oligonucleotide is replaced with C-5 propynyluracil, analogous oligonucleotides in 

5 which uracil in an oligonucleotide is replaced with C-5 thiazoluracil, analogous oligonucleotides in which cytosine in 
an oligonucleotide is replaced with C-5 propynylcytosine, analogous oligonucleotides in which cytosine in an oligonu- 
cleotide is replaced with phenoxazine-modified cytosine, analogous oligonucleotides in which ribose in an oligonucle- 
otide is replaced with a'-O-propylribose, analogous oligonucleotides in which ribose in an oligonucleotide is replaced 
with a'-methoxyethoxyribose, and the like (Celt Engineering, 16: 1463 (1997)). 

10 [0119] The above oligonucleotides and analogous oligonucleotides of the present invention can be used as probes 
for hybridization and antisense nucleic acids described below in addition to as primers. 

[0120] Examples of a primer for the antisense nucleic acid techniques known in the art include an oligonucleotide 
which hybridizes the oligonucleotide of the present invention under stringent conditions and has an activity regulating 
expression of the polypeptide encoded by the polynucleotide, in addition to the above oligonucleotide. 

15 

3. Determination of isozymes 

[0121] Many mutants of coryneform bacteria which are useful in the production of useful substances, such as amino 
acids, nucleic acids, vitamins, saccharides, organic acids, and the like, are obtained by the present invention. 

20 [0122] However, since the gene sequence data of the microorganism has been, to date, insufficient, useful mutants 
have been obtained by mutagenic techniques using a mutagen, such as nitrosoguanidine (NTG) or the like. 
[0123] Although genes can be mutated randomly by the mutagenic method using the above-described mutagen, all 
genes encoding respective isozymes having similar properties relating to the metabolism of intermediates cannot be 
mutated. In the mutagenic method using a mutagen, genes are mutated randomly. Accordingly, harmful mutations 

25 worsening culture characteristics, such as delay in growth, accelerated foaming, and the like, might be imparted at a 
great frequency, in a random manner. 

[0124] However, if gene sequence information is available, such as is provided by the present invention, it is possible 
to mutate all of the genes encoding target isozymes. In this case, harmful mutations may be avoided and the target 
mutation can be incorporated. 

30 [0125] Namely, an accurate number and sequence information of the target isozymes in coryneform bacteria can be 
obtained based on the ORF data obtained in the above item 2. By using the sequence information, all of the target 
isozyme genes can be mutated into genes having the desired properties by, for example, the site-specific mutagenesis 
method described in Molecular Cloning, 2nd ed. to obtain useful mutants having elevated productivity of useful sub- 
stances. 

35 

4. Clarification or determination of biosynthesis pathway and signal transmission pathway 

[0126] Attempts have been made to elucidate biosynthesis pathways and signal transmission pathways in a number 
of organisms, and many findings have been reported. However, there are many unknown aspects of coryneform bac- 
40 teria since a number of genes have not been identified so far. 

[0127] These unknown points can be clarified by the following method. 

[0128] The functional information of ORF derived from coryneform bacteria as identified by the method of above item 
2 is arranged. The term "arranged" means that the ORF is classified based on the biosynthesis pathway of a substance 
or the signal transmission pathway to which the ORF belongs using known information according to the functional 
45 information. Next, the arranged ORF sequence information is compared with enzymes on the biosynthesis pathways 
or signal transmission pathways of other known organisms. The resulting information is combined with known data on 
coryneform bacteria. Thus, the biosynthesis pathways and signal transmission pathways in coryneform bacteria, which 
have been unknown so far, can be determined. 

[0129] As a result that these pathways which have been unknown or unclear hitherto are clarified, a useful mutant 
50 for producing a target useful substance can be efficiently obtained. 

[0130] When the thus clarified pathway is judged as important in the synthesis of a useful product, a useful mutant 
can be obtained by selecting a mutant wherein this pathway has been strengthened. Also, when the thus clarified 
pathway is judged as not important in the biosynthesis of the target useful product, a useful mutant can be obtained 
by selecting a mutant wherein the utilization frequency of this pathway is lowered. 

55 

5. Clarification or determination of useful mutation point 

[0131] Many useful mutants of coryneform bacteria which are suitable for the production of useful substances, such 
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as amino acids, nucleic acids, vitamins, saccharides, organic acids, and the like, have been obtained. However, it is 
hardly known which mutation point is imparted to a gene to improve the productivity. 

[0132] However mutation points contained in production strains can be identified by comparing desired sequences 
of the genome DNA of the production strains obtained from coryneform bacteria by the mutagenic technique with the 
5 nucleotide sequences of the corresponding genome DNA and ORF derived from coryneform bacteria determined by 
the methods of the above items 1 and 2 and analyzing them 

[0133] Moreover, effective mutation points contributing to the production can be easily specified from among these 
mutation points on the basis of known information relating to the metabolic pathways, the metabolic regulatory mech- 
anisms, the structure activity correlation of enzymes, and the like. 
10 [0134] When any efficient mutation can be hardly specified based on known data, the mutation points thus identified 
can be introduced into a wild strain of coryneform bacteria or a production strain free of the mutation. Then, it is examined 
whether or not any positive effect can be achieved on the production. 

[0135] For example, by comparing the nucleotide sequence of homoserine dehydrogenase gene horn of a lysine- 
producing B-6 strain of Corynebacterium gfutamicum (Appl. Microbiol. Biotechnol., 32: 269-273 (1989)) with the nu- 
15 cleotide sequence corresponding to the genome of Corynebacterium gfutamicum ATCC 1 3032 according to the present 
invention, a mutation of amino acid replacement in which valine at the 59-position is replaced with alanine (Val59Ala) 
was identified. A strain obtained by introducing this mutation into the ATCC 13032 strain by the gene replacement 
method can produce lysine, which indicates that this mutation is an effective mutation contributing to the production 
of lysine. 

20 [0136] Similarly, by comparing the nucleotide sequence of pyruvate carboxylase gene pyc of the B-6 strain with the 
nucleotide sequence corresponding to the ATCC 1 3032 genome, a mutation of amino acid replacement in which proline 
at the 458-position was replaced with serine (Pro458Ser) was identified. A strain obtained by introducing this mutation 
into a lysine-producing strain of No. 58 (FERM BP-7134) of Corynebacterium glutamicum free of this mutation shows 
an improved lysine productivity in comparison with the No. 58 strain, which indicates that this mutation is an effective 

25 mutation contributing to the production of lysine. 

[0137] In addition, a mutation A1a213Thr in glucose-6-phosphate dehydrogenase was specified as an effective mu- 
tation relating to the production of lysine by detecting glucose-6-phosphate dehydrogenase gene zwloi the B-6 strain. 
[0138] Furthermore, the lysine-productivity of Corynebacterium glutamicum was improved by replacing the base at 
the 932-position of aspartokinase gene lysC of the Corynebacterium glutamicum ATCC 13032 genome with cytosine 

30 to thereby replace threonine at the 311 -position by isoleucine ! which indicates that this mutation is an effective mutation 
contributing to the production of lysine. 

[01 39] Also, as another method to examine whether or not the identified mutation point is an effective mutation, there 
is a method in which the mutation possessed by the lysine-producing strain is returned to the sequence of a wild type 
strain by the gene replacement method and whether or not it has a negative influence on the lysine productivity. For 
35 example, when the amino acid replacement mutation Val59Ala possessed by horn of the lysine-producing B-6 strain 
was returned to a wild type amino acid sequence, the lysine productivity was lowered in comparison with the B-6 strain. 
Thus, it was found that this mutation is an effective mutation contributing to the production of lysine. 
[0140] Effective mutation points can be more efficiently and comprehensively extracted by combining, if needed, the 
DNA array analysis or proteome analysis described below. 

40 

6. Method of breeding industrially advantageous production strain 



[0141] It has been a general practice to construct production strains, which are used industrially in the fermentation 
production of the target useful substances, such as amino acids, nucleic acids, vitamins, saccharides, organic acids, 
and the like, by repeating mutagenesis and breeding based on random mutagenesis using mutagens, such as NTG 
or the like, and screening. 

[0142] In recent years, many examples of improved production strains have been made through the use of recom- 
binant DNA techniques. In breeding, however, most of the parent production strains to be improved are mutants ob- 
tained by a conventional mutagenic procedure (W. Leuchtenberger, Amino Acids - Technical Production and Use. In: 
Roehr (ed) Biotechnology, second edition, vol. 6, products of primary metabolism. VCH VeriagsgesellschaftmbH, Wein- 
heim. P 465 (1996)). 

[0143] Although mutagenesis methods have largely contributed to the progress of the fermentation industry, they 
suffer from a serious problem of multiple, random introduction of mutations into every part of the chromosome. Since 
many mutations are accumulated in a single chromosome each time a strain is improved, a production strain obtained 
by the random mutation and selecting is generally inferior in properties (for example, showing poor growth, delayed 
consumption of saccharides, and poor resistance to stresses such as temperature and oxygen) to a wild type strain, 
which brings about troubles such as failing to establish a sufficiently elevated productivity, being frequently contami- 
nated with miscellaneous bacteria, requiring troublesome procedures in culture maintenance, and the like, and, in its 
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turn, elevating the production cost in practice. In addition, the improvement in the productivity is based on random 
mutations and thus the mechanism thereof is unclear. Therefore, it is very difficult to plan a rational breeding strategy 
for the subsequent improvement in the productivity. 

[0144] According to the present invention, effective mutation points contributing to the production can be efficiently 
5 specified from among many mutation points accumulated in the chromosome of a production strain which has been 
bred from coryneform bacteria and, therefore, a novel breeding method of assembling these effective mutations in the 
coryneform bacteria can be established. Thus, a useful production strain can be reconstructed. It is also possible to 
construct a useful production strain from a wild type strain. 
[0145] Specifically, a useful mutant can be constructed in the following manner. 
10 [0146] One of the mutation points is incorporated into a wild type strain of coryneform bacteria. Then, it is examined 
whether or not a positive effect is established on the production. When a positive effect is obtained, the mutation point 
is saved. When no effect is obtained, the mutation point is removed. Subsequently, only a strain having the effective 
mutation point is used as the parent strain, and the same procedure is repeated. In general, the effectiveness of a 
mutation positioned upstream cannot be clearly evaluated in some cases when there is a rate-determining point in the 
15 downstream of a biosynthesis pathway. It is therefore preferred to successively evaluate mutation points upward from 
downstream. 

[0147] By reconstituting effective mutations by the method as described above in a wild type strain or a strain which 
has a high growth speed or the same ability to consume saccharides as the wild type strain, it is possible to construct 
an industrially advantageous strain which is free of troubles in the previous methods as described above and to conduct 

20 fermentation production using such strains within a short time or at a higher temperature. 

[0148] For example, a lysine-producing mutant B-6 (Appl. Microbiol. BiotechnoL, 32: 262-273 (1989)), which is ob- 
tained by multiple rounds of random mutagenesis from a wild type strain Corynebacterium gtutamicum ATCC 13032, 
enables lysine fermentation to be performed at a temperature between 30 and 34°C but shows lowered growth and 
lysine productivity at a temperature exceeding 34°C. Therefore, the fermentation temperature should be maintained 

25 at 34°C or lower. In contrast thereto, the production strain described in the above item 5, which is obtained by recon- 
stituting effective mutations relating to lysine production, can achieve a productivity at 40 to 42°C equal or superior to 
the result obtained by culturing at 30 to 34°C. Therefore, this strain is industrially advantageous since it can save the 
load of cooling during the fermentation. 

[0149] When culture should be carried out at a high temperature exceeding 43°C, a production strain capable of 
30 conducting fermentation production at a high temperature exceeding 43°C can be obtained by reconstituting useful 
mutations in a microorganism belonging to the genus Corynebacterium which can grow at high temperature exceeding 
43°C. Examples of the microorganism capable of growing at a high temperature exceeding 43°C include Corynebac- 
terium thermoaminogenes, such as Corynebacterium thermoaminogenes FERM 9244, FERM 9245, FERM 9246 and 
FERM 9247. 

35 [0150] A strain having a further improved productivity of the target product can be obtained using the thus recon- 
structed strain as the parent strain and further breeding it using the conventional mutagenesis method, the gene am- 
plification method, the gene replacement method using the recombinant DNA technique, the transduction method or 
the cell fusion method. Accordingly, the microorganism of the present invention includes, but is not limited to, a mutant, 
a cell fusion strain, a transformant, a transductant or a recombinant strain constructed by using recombinant DNA 

40 techniques, so long as it is a producing strain obtained via the step of accumulating at least two effective mutations in 
a coryneform bacteria in the course of breeding. 

[0151] When a mutation point judged as being harmful to the growth or production is specified, on the other hand, 
it is examined whether or not the producing strain used at present contains the mutation point. When it has the mutation , 
it can be returned to the wild type gene and thus a further useful production strain can be bred. 
45 [0152] The breeding method as described above is applicable to microorganisms, other than coryneform bacteria, 
which have industrially advantageous properties (for example, microorganisms capable of quickly utilizing less expen- 
sive carbon sources, microorganisms capable of growing at higher temperatures). 

7. Production and utilization of polynucleotide array 

50 

(1) Production of polynucleotide array 

[0153] A polynucleotide array can be produced using the polynucleotide or oligonucleotide of the present invention 
obtained in the above items 1 and 2. 
55 [0154] Examples include a polynucleotide array comprising a solid support to which at least one of a polynucleotide 
comprising the nucleotide sequence represented by SEQ ID NOS:2 to 3501 , a polynucleotide which hybridizes with 
the polynucleotide under stringent conditions, and a polynucleotide comprising 10 to 200 continuous nucleotides in 
the nucleotide sequence of the polynucleotide is adhered; and a polynucleotide array comprising a solid support to 
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which at least one of a polynucleotide encoding a polypeptide comprising the amino acid sequence represented by 
any one of SEQ ID NOS:3502 to 7001, a polynucleotide which hybridizes with the polynucleotide under stringent 
conditions, and a polynucleotide comprising 10 to 200 continuous bases in the nucleotide sequences of the polynu- 
cleotides is adhered. 

[0155] Polynucleotide arrays of the present invention include substrates known in the art, such as a DNA chip, a 
DNA microarray and a DNA macroarray, and the like, and comprises a solid support and plural polynucleotides or 
fragments thereof which are adhered to the surface of the solid support. 
[0156] Examples of the solid support include a glass plate, a nylon membrane, and the like. 

[0157] The polynucleotides or fragments thereof adhered to the surface of the solid support can be adhered to the 
surface of the solid support using the general technique for preparing arrays. Namely, a method in which they are 
adhered to a chemically surface-treated solid support, for example, to which a polycation such as polylysine or the like 
has been adhered {Nat. Genet. ,21: 15-19 (1999)). The chemically surface-treated supports are commercially available 
and the commercially available solid product can be used as the solid support of the polynucleotide array according 
to the present invention. 

[0158] As the polynucleotides or oligonucleotides adhered to the solid support, the polynucleotides and oligonucle- 
otides of the present Invention obtained in the above items 1 and 2 can be used. 

[01 59] The analysis described below can be efficiently performed by adhering the polynucleotides or oligonucleotides 
to the solid support at a high density, though a high fixation density is not always necessary. 

[0160] Apparatus for achieving a high fixation density, such as an arrayer robot or the like, is commercially available 
from Takara Shuzo (GMS417 Arrayer), and the commercially available product can be used, 

[0161] Also, the oligonucleotides of the present invention can be synthesized directly on the solid support by the 
photolithography method or the like (Nat. Genet, 21: 20-24 (1 999)). In this method, a linker having a protective group 
which can be removed by light irradiation is first adhered to a solid support, such as a slide glass or the like. Then, it 
is irradiated with light through a mask (a photolithograph mask) permeating light exclusively at a definite part of the 
adhesion part. Next, an oligonucleotide having a protective group which can be removed by light irradiation is added 
to the part. Thus, a ligation reaction with the nucleotide arises exclusively at the irradiated part. By repeating this 
procedure, oligonucleotides, each having a desired sequence, different from each other can be synthesized in respec- 
tive parts. Usually, the oligonucleotides to be synthesized have a length of 10 to 30 nucleotides. 

30 (2) Use of polynucleotide array 

[0162] The following procedures (a) and (b) can be carried out using the polynucleotide array prepared in the above 
d). 

35 (a) Identification of mutation point of coryneform bacterium mutant and analysis of expression amount and expression 
profile of gene encoded by genome 

[0163] By subjecting a gene derived from a mutant of coryneform bacteria or an examined gene to the following 
steps (i) to (iv), the mutation point of the gene can be identified or the expression amount and expression profile of the 
40 gene can be analyzed: 

(i) producing a polynucleotide array by the method of the above (1); 

(ii) incubating polynucleotides immobilized on the polynucleotide array together with the labeled gene derived from 
a mutant of the coryneform bacterium using the polynucleotide array produced in the above (i) under hybridization 

45 conditions; 

(iii) detecting the hybridization; and 

(iv) analyzing the hybridization data. 

[0164] The gene derived from a mutant of coryneform bacteria or the examined gene include a gene relating to 
50 biosynthesis of at least one selected from amino acids, nucleic acids, vitamins, saccharides, organic acids, and ana- 
logues thereof. 

[0165] The method will be described in detail. 

[0166] A single nucleotide polymorphism (SNP) in a human region of 2,300 kb has been identified using polynucle- 
otide arrays (Science, 280. 1 077-82 (1 998)). In accordance with the method of identifying SNP and methods described 
55 in Science, 278: 680-686 (1997); Proc. Natl. Acad. Sci. USA, 96: 12833-38 (1999); Science, 284: 1520-23 (1999) : and 
the like using the polynucleotide array produced in the above (1 ) and a nucleic acid molecule (DNA, RNA) derived from 
coryneform bacteria in the method of the hybridization, a mutation point of a useful mutant, which is useful in producing 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, or the like can be identified and the gene 
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expression amount and the expression profile thereof can be analyzed. 

[0167] The nucleic acid molecule (DNA, RNA) derived from the coryneform bacteria can be obtained according to 
the general method described in Molecular Cloning, 2nd ed. or the like. mRNA derived from Corynebacterium glutami- 
cum can also be obtained by the method of Bormann et al. (Molecular Microbiology, 6: 31 7-326 (1 992)) or the like. 
5 [0168] Although ribosomal RNA (rRNA) is usually obtained in large excess in addition to the target mRNA, the anal- 
ysis is not seriously disturbed thereby. 

[0169] The resulting nucleic acid molecule derived from coryneform bacteria is labeled. Labeling can be carried out 
according to a method using a fluorescent dye, a method using a radioisotope or the like. 

[0170] Specific examples include a labeling method in which psoralen-biotin is crosslinked with RNA extracted from 
10 a microorganism and, after hybridization reaction, a fluorescent dye having streptoavidin bound thereto is bound to 
the biotin moiety (Nat. BiotechnoL, 16: 45-48 (1998)); a labeling method in which a reverse transcription reaction is 
carried out using RNA extracted from a microorganism as a template and random primers as primers, and dUTP having 
a fluorescent dye (for example, Cy3, Cy5) (manufactured by Amersham Pharmacia Biotech) is incorporated into cDNA 
(Proc. Natl. Acad Sci. USA, 96: 12833-38 (1999)); and the like. 
15 [0171] The labeling specificity can be improved by replacing the random primers by sequences complementary to 
the 3-end of ORF (J. Bacteriol., 181: 6425-40 (1999)). 

[0172] In the hybridization method, the hybridization and subsequent washing can be carried out by the general 
method (Nat. BioctechnoL, 14: 1675-80 (1996), or the like). 

[0173] Subsequently, the hybridization intensity is measured depending on the hybridization amount of the nucleic 
20 acid molecule used in the labeling. Thus, the mutation point can be identified and the expression amount of the gene 
can be calculated. 

[0174] The hybridization intensity can be measured by visualizing the fluorescent signal, radioactivity, luminescence 
dose, and the like, using a laser confocal microscope, a CCD camera, a radiation imaging device (for example, STORM 
manufactured by Amersham Pharmacia Biotech), and the like, and then quantifying the thus visualized data. 
25 [0175] A polynucleotide array on a solid support can also be analyzed and quantified using a commercially available 
apparatus, such as GMS418 Array Scanner (manufactured by Takara Shuzo) or the like. 

[01 76] The gene expression amount can be analyzed using a commercially available software (for example, ImaGene 
manufactured by Takara Shuzo; Array Gauge manufactured by Fuji Photo Film; ImageQuant manufactured by Amer- 
sham Pharmacia Biotech, or the like). 
30 [0177] A fluctuation in the expression amount of a specific gene can be monitored using a nucleic acid molecule 
obtained in the time course of culture as the nucleic acid molecule derived from coryneform bacteria. The culture 
conditions can be optimized by analyzing the fluctuation. 

[0178] The expression profile of the microorganism at the total gene level (namely, which genes among a great 
number of genes encoded by the genome have been expressed and the expression ratio thereof) can be determined 
35 using a nucleic acid molecule having the sequences of many genes determined from the full genome sequence of the 
microorganism. Thus, the expression amount of the genes determined by the full genome sequence can be analyzed 
and, in its turn, the biological conditions of the microorganism can be recognized as the expression pattern at the full 
gene level. 

40 (b) Confirmation of the presence of gene homologous to examined gene in coryneform bacteria 

[0179] Whether or not a gene homologous to the examined gene, which is present in an organism other than co- 
ryneform bacteria, is present in coryneform bacteria can be detected using the polynucleotide array prepared in the 
above (1). 

45 [0180] This detection can be carried out by a method in which an examined gene which is present in an organism 
other than coryneform bacteria is used instead of the nucleic acid molecule derived from coryneform bacteria used in 
the above identification/analysis method of (1). 

8. Recording medium storing full genome nucleotide sequence and ORF data and being readable by a computer and 
50 methods for using the same 

[0181] The term "recording medium or storage device which is readable by a computer" means a recording medium 
or storage medium wh ich can be directly readout and accessed with a computer. Examples include magnetic recording 
media, such as a floppy disk, a hard disk, a magnetic tape, and the like; optical recording media, such as CD-ROM, 
55 CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, and the like; electric recording media, such as RAM, ROM, and the 
like; and hybrids in these categories (for example, magnetic/optical recording media, such as MO and the like). 
[01 82] Instruments for recording or inputting in or on the recording medium or instruments or devices for reading out 
the information in the recording medium can be appropriately selected, depending on the type of the recording medium 
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and the access device utilized. Also, various data processing programs, software, comparator and formats are used 
for recording and utilizing the polynucleotide sequence information or the like, of the present invention in the recording 
medium. The information can be expressed in the form of a binary file, a text file or an ASCII file formatted with com- 
mercially available software, for example. Moreover, software for accessing the sequence information is available and 

5 known to one of ordinary skill in the art. 

[01 83] Examples of the information to be recorded in the above-described medium include the full genome nucleotide 
sequence information of coryneform bacteria as obtained in the above item 2, the nucleotide sequence information of 
ORF, the amino acid sequence information encoded by the ORF, and the functional information of polynucleotides 
coding for the amino acid sequences. 

io [0184] The recording medium or storage device which is readable by a computer according to the present invention 
refers to a medium in which the information of the present invention has been recorded. Examples include recording 
media or storage devices which are readable by a computer storing the nucleotide sequence information represented 
by SEQ ID NOS:1 to 3501, the amino acid sequence information represented by SEQ ID NOS:3502 to 7001, the 
functional information of the nucleotide sequences represented by SEQ ID NOS:1 to 3501 , the functional information 

is of the amino acid sequences represented by SEQ ID NOS:3502 to 7001 , and the information listed in Table 1 below 
and the like. 



9. System based on a computer using the recording medium of the present invention which is readable by a computer 



20 [0185] The term "system based on a computer" as used herein refers a system composed of hardware device(s), 
software device(s), and data recording device(s) which are used for analyzing the data recorded in the recording me- 
dium of the present invention which is readable by a computer. 

[01 86] The hardware device(s) are, for example, composed of an input unit, a data recording unit, a central processing 
unit and an output unit collectively or individually. 

25 [0187] By the software device(s), the data recorded in the recording medium of the present invention are searched 
or analyzed using the recorded data and the hardware device(s) as described herein. Specifically, the software device 
(s) contain at least one program which acts on or with the system in order to screen, analyze or compare biologically 
meaningful structures or information from the nucleotide sequences, amino acid sequences and the like recorded in 
the recording medium according to the present invention. 

30 [0188] Examples of the software device(s) for identifying ORF and EMF domains include GeneMark (Nuc. Acids. 
Res., 22. 4756-67 (1994)), GeneHacker (Protein, Nucleic Acid and Enzyme, 42\ 3001-07 (1997)), Glimmer (The Insti- 
tute of Genomic Research; Nuc. Acids. Res., 26. 544-548 (1 998)) and the like. In the process of using such a software 
device, the default (initial setting) parameters are usually used, although the parameters can be changed, if necessary, 
in a manner known to one of ordinary skill in the art. 

35 [0189] Examples of the software device(s) for identifying a genome domain or a polypeptide domain analogous to 
the target sequence or the target structural motif (homology searching) include FASTA, BLAST, Smith -Waterman, 
GenetyxMac (manufactured by Software Development), GCG Package (manufactured by Genetic Computer Group), 
GenCore (manufactured by Compugen), and the like. In the process of using such a software device, the default (initial 
setting) parameters are usually used, although the parameters can be changed, if necessary in a manner known to 

40 one of ordinary skill in the art. 

[0190] Such a recording medium storing the full genome sequence data is useful in preparing a polynucleotide array 
by which the expression amount of a gene encoded by the genome DNA of coryneform bacteria and the expression 
profile at the total gene level of the microorganism, namely, which genes among many genes encoded by the genome 
have been expressed and the expression ratio thereof, can be determined. 

^5 [0191] The data recording device(s) provided by the present invention are, for example, memory device(s) for re- 
cording the data recorded in the recording medium of the present invention and target sequence or target structural 
motif data, or the like, and a memory accessing device(s) for accessing the same. 

[0192] Namely, the system based on a computer according to the present invention comprises the following: 

50 (i) a user input device that inputs the information stored in the recording medium of the present invention, and 

target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the information stored in the recording medium of the present invention with the 
target sequence or target structure motif information, recorded by the data storing device of (ii) for screening and 

55 analyzing nucleotide sequence information which is coincident with or analogous to the target sequence or target 

structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 
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[0193] This system is usable in the methods in items 2 to 5 as described above for searching and analyzing theORF 
and EMF domains, target sequence, target structural motif, etc. of a coryneform bacterium, searching homologs, 
searching and analyzing isozymes, determining the biosynthesis pathway and the signal transmission pathway, and 
identifying spots which have been found in the proteome analysis. The term "homologs" as used herein includes both 
5 of orthologs and paralogs. 

10. Production of polypeptide using ORF derived from coryneform bacteria 

[01 94] The polypeptide of the present invention can be produced using a polynucleotide comprising the ORF obtained 
10 in the above item 2. Specifically, the polypeptide of the present invention can be produced by expressing the polynu- 
cleotide of the present invention or a fragment thereof in a host cell, using the method described in Molecular Cloning, 
2nd ed., Current Protocols in Molecular Biology, and the like, for example, according to the following method. 
[0195] A DNA fragment having a suitable length containing a part encoding the polypeptide is prepared from the full 
length ORF sequence, if necessary. 
15 [0196] Also, DNA in which nucleotides in a nucleotide sequence at a part encoding the polypeptide of the present 
invention are replaced to give a codon suitable for expression of the host cell, if necessary. The DNA is useful for 
efficiently producing the polypeptide of the present invention. 

[0197] A recombinant vector is prepared by inserting the DNA fragment into the downstream of a promoter in a 
suitable expression vector. 
20 [0198] The recombinant vector is introduced to a host cell suitable for the expression vector. 

[0199] Any of bacteria, yeasts, animal cells, insect cells, plant cells, and the like can be used as the host cell so long 
as it can be expressed in the gene of interest. 

[0200] Examples of the expression vector include those which can replicate autonomously in the above-described 
host cell or can be integrated into chromosome and have a promoter at such a position that the DNA encoding the 

25 polypeptide of the present invention can be transcribed. 

[0201] When a procaryote cell, such as a bacterium or the like, is used as the host cell, it is preferred that the 
recombinant vector containing the DNA encoding the polypeptide of the present invention can replicate autonomously 
in the bacterium and is a recombinant vector constituted by, at least a promoter, a ribosome binding sequence, the 
DNA of the present invention and a transcription termination sequence. A promoter controlling gene can also be con- 

30 tained therewith in operable combination. 

[0202] Examples of the expression vectors include a vector plasmid which is replicable in Cory nebacteri urn glutami- 
cum, such as pCGI (Japanese Published Unexamined Patent Application No. 134500/82), pCG2 (Japanese Published 
Unexamined Patent Application No. 35197/83), pCG4 (Japanese Published Unexamined Patent Application No. 
183799/82), pCG11 (Japanese Published Unexamined Patent Application No. 134500/82), pCG116, pCE54 and 

35 pCB101 (Japanese Published Unexamined Patent Application No. 1 05999/83), pCE51 , pCE52 and pCE53 (Mol. Gen. 
Genet, 196: 175-178 (1984)) : and the like; a vector plasmid which is replicable in Escherichia coti, such as pET3 and 
pET11 (manufactured by Stratagene), pBAD, pThioHis and pTrcHis (manufactured by Invitrogen), pKK223-3 and 
pGEX2T (manufactured by Amersham Pharmacia Biotech), and the like; and pBTrp2, pBTad and pBTac2 (manufac- 
tured by Boehringer Mannheim Co.), pSE280 (manufactured by Invitrogen), pGEMEX-1 (manufactured by Promega), 

40 pQE-8 (manufactured by QIAGEN) : pKYPIO (Japanese Published Unexamined Patent Application No. 110600/83), 
pKYP200 (Agric. Biol. Chem., 48: 669 (1984)), pLSA1 (Agric. Biol. Chem., 53: 277 (1989)), pGEL1 (Proc. Natl. Acad. 
Sci. USA, 82: 4306 (1985)), pBluescript II SK(-) (manufactured by Stratagene), pTrs30 (prepared from Escherichia coli 
JM109/pTrS30 (FERM BP-5407)), pTrs32 (prepared from Escherichia co// JM109/pTrS32 (FERM BP-5408)), pGHA2 
(prepared from Escherichia coli IGHA2 (FERM B-400), Japanese Published Unexamined Patent Application No. 

45 221091/85), pGKA2 (prepared from Escherichia coti I G K A2 (FERM BP-6798), Japanese Published Unexamined Patent 
Application No. 221091/85), pTerm2 (U.S. Patents 4,686,191, 4,939,094 and 5,160,735), pSupex, pUB110, pTP5, 
pC194 andpEG400 (J. Bactehol., 172: 2392 (1 990)), pGEX (manufactured by Pharmacia), pET system (manufactured 
by Novagen), and the like. 

[0203] Any promoter can be used so long as it can function in the host cell. Examples include promoters derived 
50 from Escherichia coli, phage and the like, such as trp promoter (P trp ) , lac promoter, P L promoter, P R promoter, T7 

promoter and the like. Also, artificially designed and modified promoters, such as a promoter in which two Ptrp are 

linked in series (P +rp x2) , tac promoter, /acT7 promoter let\ promoter and the like, can be used. 

[0204] It is preferred to use a plasmid in which the space between Shine-Dalgamo sequence which is the ribosome 

binding sequence and the initiation codon is adjusted to an appropriate distance (for example, 6 to 18 nucleotides). 
55 [0205] The transcription termination sequence is not always necessary for the expression of the DNA of the present 

invention. However, it is preferred to arrange the transcription terminating sequence at just downstream of the structural 

gene. 

[0206] One of ordinary skill in the art will appreciate that the codons of the above-described elements may be opti- 
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mi7ed, in a known manner, depending on the host cells and environmental conditions utilized. 
[0207] Examples of the host cell include microorganisms belonging to the genus Escherichia, the genus Serratia, 
the genus Bacillus, the genus Brevibacterium, the genus Corynebacterium, the genus Microbacterium, the genus Pseu- 
domonas, and the like. Specific examples include Escherichia coli XL1-Blue, Escherichia co// XL2-Blue, Escherichia 
5 coli DH1 , Escherichia co// MC1000. Escherichia coli KY3276, Escherichia coli r W1485, Escherichia coli ; JM109, Es- 
cner/c/i/aco//HB101, Escherichia coli No. 49, Escherichia coli W31 1 0, Escherichia coli NY 49, Escherichia coli G\698, 
Escherichia coli TB1, Serratia ficaria, Serratia fonticola, Serratia liquefaciens, Serratia marcescens, Bacillus subtilis, 
Bacillus amyloliquefaciens, Corynebacterium ammonia genes, Brevibacterium immahophilum ATCC 14068, Brevibac- 
terium saccharolyticum ATCC 14066, Corynebacterium gtutamicum ATCC 13032, Corynebacterium glutamicum ATCC 
'0 1 3869, Corynebacterium gtutamicum ATCC 1 4067 (prior genus and species: Brevibacterium flavum), Corynebacterium 
gtutamicum ATCC 1 3869 (prior genus and species: Brevibacterium lactolermentum, or Corynebacterium lactofermen- 
tum), Corynebacterium acetoacidoph'tium ATCC 13870, Corynebacterium thermoaminogenes FERM 9244 : Microbac- 
terium ammoniaphifum ATCC 15354, Pseudomonas putida, Pseudomonas sp. D-0110, and the like. 
[0208] When Corynebacterium glutamicum or an analogous microorganism is used as a host, an EMF necessary 

'5 for expressing the polypeptide is not always contained in the vector so long as the polynucleotide of the present in- 
vention contains an EMF. When the EMF is not contained in the polynucleotide, it is necessary to prepare the EMF 
separately and ligate it so as to be in operable combination. Also, when a higher expression amount or specific ex- 
pression regulation is necessary, it is necessary to ligate the EMF corresponding thereto so as to put the EMF in 
operable combination with the polynucleotide. Examples of using an externally ligated EMF are disclosed in Microbi- 

20 ology 142: 1297-1309(1996). 

[0209] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
the above-described host cells, such as a method in which a calcium ion is used (Proc. Natl. Acad. Sci. USA, 69: 2110 
(1972)), a protoplast method (Japanese Published Unexamined Patent Application No. 2483942/88), the methods 
described in Gonc t 17: 107 (1982) and Mo/ocular & General Genetics, 168: 111 (1979) and the like, can be used. 

25 [0210] When yeast is used as the host cell, examples of the expression vector include pYES2 (manufactured by 
Invitrogen), YEp13 (ATCC 37115), YEp24 (ATCC 37051), YCp50 (ATCC 37419), pHS19, pHS15, and the like. 
[0211] Any promoter can be used so long as it can be expressed in yeast. Examples include a promoter of a gene 
in the glycolytic pathway such as hexose kinase and the like, PH05 promoter, PGK promoter, GAP promoter, ADH 
promoter, gai 1 promoter, gal 10 promoter, a heat shock protein promoter, MFal promoter, CUP 1 promoter, and the like. 

30 [0212] Examples of the host cell include microorganisms belonging to the genus Saccharomyces, the genus 
Schizosaccharomyces, the genus Kluyveromyces, the genus Trichosporon, the genus Schwanniomyces, the genus 
Pichia, the genus Candida and the like. Specific examples include Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Kluyveromyces lactis, Trichosporon pullulans, Schwanniomyces alluvius, Candida utilis and the like. 
[0213] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 

35 yeast, such as an electroporation method (Methods. EnzymoL, 194: 182 (1990)), a spheroplast method (Proc. Natl. 
Acad. Sci. USA, 75: 1929 (1978)), a lithium acetate method (J. Bacteriol., 153: 163 (1983)), a method described in 
Proc. Natl. Acad. Sci. USA, 75: 1929 (1978) and the like, can be used. 

[0214] When animal cells are used as the host cells, examples of the expression vector include pcDNA3.1 , pSinRep5 
and pCEP4 (manufactured by Invitorogen), pRev-Tre (manufactured by Clontech), pAxCAwt (manufactured by Takara 
*o Shuzo), pcDNAI and pcDM8 (manufactured by Funakoshi), pAGE107 (Japanese Published Unexamined Patent Ap- 
plication No. 22979/91 ; Cytotechnology, 21 33 (1 990)), pAS3-3 (Japanese Published Unexamined Patent Application 
No. 227075/90), pcDM8 (Nature, 329: 840 (1 987)), pcDNAI/Amp (manufactured by Invitrogen), pREP4 (manufactured 
by Invitrogen), pAGE103 (J. Biochem., 101: 1307 (1987)), pAGE210, and the like. 

[0215] Any promoter can be used so long as it can function in animal cells. Examples include a promoter of IE 
*5 (immediate early) gene of cytomegalovirus (CMV), an early promoter of SV40, a promoter of retrovirus, a metal- 
lothionein promoter, a heat shock promoter, SRa promoter, and the like. Also, the enhancer of the IE gene of human 
CMV can be used together with the promoter. 

[0216] Examples of the host cell include human Namalwa cell, monkey COS cell, Chinese hamster CHO cell, 
HST5637 (Japanese Published Unexamined Patent Application No. 299/88), and the like. 
50 [0217] The method for introduction of the recombinant vector into animal cells is not particularly limited, so long as 
it is the general method for introducing DNA into animal cells, such as an electroporation method (Cytotechnology 3: 
133 (1990)), a calcium phosphate method (Japanese Published Unexamined Patent Application No. 227075/90), a 
lipofection method (Proc. Natl. Acad. Sci. USA, 84, 7413 (1987)), the method described in Virology 52-. 456 (1973), 
and the like. 

>5 [0218] When insect cells are used as the host cells, the polypeptide can be expressed, for example, by the method 
described in Bacurovirus Expression Vectors, A Laboratory Manual, W.H. Freeman and Company, New York (1992), 
Bio/Technology, 6: 47 (1988), or the like. 

[0219] Specifically, a recombinant gene transfer vector and bacurovirus are simultaneously inserted into insect cells 
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to obtain a recombinant virus in an insect cell culture supernatant, and then the insect cells are infected with the resulting 
recombinant virus to express the polypeptide. 

[0220] Examples of the gene introducing vector used in the method include pBlueBac4.5, pVL1392, pVL1393 and 
pBlueBaclll (manufactured by Invitrogen), and the like. 
5 [0221] Examples of the bacurovirus include Autographa californica nuclear polyhedrosis virus with which insects of 
the family Barathra are infected, and the like. 

[0222] Examples of the insect cells include Spodoptera frugiperda oocytes Sf9 and Sf21 (Bacurovirus Expression 
Vectors, A Laboratory Manual, W.H. Freeman and Company, New York (1992)), Trichoplusia ni oocyte High 5 (manu- 
factured by Invitrogen) and the like. 
10 [0223] The method for simultaneously incorporating the above-described recombinant gene transfer vector and the 
above-described bacurovirus for the preparation of the recombinant virus include calcium phosphate method (Japanese 
Published Unexamined Patent Application No. 227075/90), lipofection method {Proc. Natl. Acad. Sci. USA, 84: 7413 
(1987)) and the like. 

[0224] When plant cells are used as the host cells, examples of expression vector include a Ti plasmid, a tobacco 
15 mosaic virus vector, and the like. 

[0225] Any promoter can be used so long as it can be expressed in plant cells. Examples include 35S promoter of 
cauliflower mosaic virus (CaMV), rice actin 1 promoter, and the like. 

[0226] Examples of the host cells include plant cells and the like, such as tobacco, potato, tomato, carrot, soybean, 
rape, alfalfa, rice, wheat, barley, and the like. 

20 [0227] The method for introducing the recombinant vector is not particularly limited, so long as it is the general method 
for introducing DNA into plant cells, such as the Agrobacterium method (Japanese Published Unexamined Patent 
Application No. 140885/84, Japanese Published Unexamined Patent Application No. 70080/85, WO 94/00977), the 
electroporation method (Japanese Published Unexamined Patent Application No. 251887/85), the particle gun method 
(Japanese Patents 2606856 and 2517813), and the like. 

25 [0228] The transformant of the present invention includes a transformant containing the polypeptide of the present 
invention per se rather than as a recombinant vector, that is, a transformant containing the polypeptide of the present 
invention which is integrated into a chromosome of the host, in addition to the transformant containing the above 
recombinant vector. 

[0229] When expressed in yeasts, animal cells, insect cells or plant cells, a glycopolypeptide or glycosylated polypep- 
30 tide can be obtained. 

[0230] The polypeptide can be produced by culturing the thus obtained transformant of the present invention in a 
culture medium to produce and accumulate the polypeptide of the present invention or any polypeptide expressed 
under the control of an EMF of the present invention, and recovering the polypeptide from the culture. 
[0231] Culturing of the transformant of the present invention in a culture medium is carried out according to the 
35 conventional method as used in culturing of the host. 

[0232] When the transformant of the present invention is obtained using a prokaryote, such as Escherichia coliox 
the like, or a eukaryote, such as yeast or the like, as the host, the transformant is cultured. 

[0233] Any of a natural medium and a synthetic medium can be used, so long as it contains a carbon source, a 
nitrogen source, an inorganic salt and the like which can be assimilated by the transformant and can perform culturing 
40 of the transformant efficiently. 

[0234] Examples of the carbon source include those which can be assimilated by the transformant, such as carbo- 
hydrates (for example, glucose, fructose, sucrose, molasses containing them, starch, starch hydrolysate, and the like), 
organic acids (for example, acetic acid, propionic acid, and the like), and alcohols (for example, ethanol, propanol, and 
the like). 

45 [0235] Examples of the nitrogen source include ammonia, various ammonium salts of inorganic acids or organic 
acids (for example, ammonium chloride, ammonium sulfate, ammonium acetate, ammonium phosphate, and the like), 
other nitrogen-containing compounds, peptone, meat extract, yeast extract, corn steep liquor, casein hydrolysate, soy- 
bean meal and soybean meal hydrolysate, various fermented cells and hydrolysates thereof, and the like. 
[0236] Examples of inorganic salt include potassium dihydrogen phosphate, dipotassium hydrogen phosphate, mag- 

50 nesium phosphate, magnesium sulfate, sodium chloride, ferrous sulfate, manganese sulfate, copper sulfate, calcium 
carbonate, and the like. 

[0237] The culturing is carried out under aerobic conditions by shaking culture, submerged-aeration stirring culture 
or the like. The culturing temperature is preferably from 15 to 40°C, and the culturing time is generally from 16 hours 
to 7 days. The pH of the medium is preferably maintained at 3.0 to 9.0 during the culturing. The pH can be adjusted 
55 using an inorganic or organic acid, an alkali solution, urea, calcium carbonate, ammonia, or the like. 

[0238] Also, antibiotics, such as ampicillin, tetracycline, and the like, can be added to the medium during the culturing, 
if necessary. 

[0239] When a microorganism transformed with a recombinant vector containing an inducible promoter is cultured, 
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an inducer can be added to the medium, if necessary. 

[0240] For example, isopropyl-p-D-thiogalactopyranoside (IPTG) or the like can be added to the medium when a 
microorganism transformed with a recombinant vector containing lac promoter is cultured, or indoleacrylic acid (IAA) 
or the like can by added thereto when a microorganism transformed with an expression vector containing trp promoter 
5 is cultured. 

[0241 ] Examples of the medium used in culturing a transf ormant obtained using animal cells as the host cells include 
RPMI 1640 medium (The Journal of the American Medical Association, 199. 519 (1967)), Eagle's MEM medium (Sci- 
ence, 12Z. 501 (1952)), Dulbecco's modified MEM medium (Virology, 8, 396 (1959)), 199 Medium (Proceeding of the 
Society for the Biological Medicine, 73: 1 ( 1 950)), the above-described media to which fetal caff serum has been added, 
10 and the like. 

[0242] The culturing is carried out generally at a pH of 6 to 8 and a temperature of 30 to 40°C in the presence of 5% 
C0 2 for 1 to 7 days. 

[0243] Also, if necessary, antibiotics, such as kanamycin, penicillin, and the like, can be added to the medium during 
the culturing. 

15 [0244] Examples of the medium used in culturing a transformant obtained using insect cells as the host cells include 
TNM-FH medium (manufactured by Pharmingen), Sf-900 II SFM (manufactured by Life Technologies), ExCell 400 and 
ExCell 405 (manufactured by JRH Biosciences), Grace's Insect Medium (Nature, 195: 788 (1962)), and the like. 
[0245] The culturing is carried out generally at a pH of 6 to 7 and a temperature of 25 to 30°C for 1 to 5 days. 
[0246] Additionally, antibiotics, such as genlamicin and the like, can be added to the medium during the culturing, if 

so necessary. 

[0247] A transformant obtained by using a plant cell as the host cell can be used as the cell or after differentiating 
to a plant cell or organ . Examples of the medium used in the culturing of the transformant include Murashige and Skoog 
(MS) medium, White medium, media to which a plant hormone, such as auxin, cytokinine, or the like has been added, 
and the like. 

25 [0248] The culturing is carried out generally at a pH of 5 to 9 and a temperature of 20 to 40°C for 3 to 60 days. 

[0249] Also, antibiotics, such as kanamycin, hygromycin and the like, can be added to the medium during the cul- 
turing, if necessary. 

[0250] As described above, the polypeptide can be produced by culturing a transformant derived from a microor- 
ganism, animal cell or plant cell containing a recombinant vector to which a DNA encoding the polypeptide of the 
30 present invention has been inserted according to the general culturing method to produce and accumulate the polypep- 
tide, and recovering the polypeptide from the culture.- 

[0251] The process of gene expression may include secretion of the encoded protein production or fusion protein 
expression and the like in accordance with the methods described in Molecular Cloning, 2nd ed., in addition to direct 
expression. 

35 [0252] The method for producing the polypeptide of the present invention includes a method of intracellular expres- 
sion in a host cell, a method of extracellular secretion from a host cell, or a method of production on a host cell membrane 
outer envelope. The method can be selected by changing the host cell employed or the structure of the polypeptide 
produced. 

[0253] When the polypeptide of the present invention is produced in a host cell or on a host ceil membrane outer 
40 envelope, the polypeptide can be positively secreted extracellulariy according to, for example, the method of Paulson 
et ai (J. Biol Chem., 264: 17619 (1989)), the method of Lowe et at. (Proc. Natl. Acad. Sci. USA, 86: 8227 (1989); 
Genes Develop., 4: 1288 (1 990)), and/orthe methods described in Japanese Published Unexamined Patent Application 
No. 336963/93, WO 94/23021 , and the like. 

[0254] Specifically, the polypeptide of the present invention can be positively secreted extracellulariy by expressing 
o it in the form that a signal peptide has been added to the foreground of a polypeptide containing an active site of the 
polypeptide of the present invention according to the recombinant DNA technique. 

[0255] Furthermore, the amount produced can be increased using a gene amplification system, such as by use of 
a dihydrofolate reductase gene or the like according to the method described in Japanese Published Unexamined 
Patent Application No. 227075/90. 
50 [0256] Moreover, the polypeptide of the present invention can be produced by a transgenic animal individual (trans- 
genic nonhuman animal) or plant individual (transgenic plant). 

[0257] When the transformant is the animal individual or plant individual; the polypeptide of the present invention 
can be produced by breeding or cultivating it so as to produce and accumulate the polypeptide, and recovering the 
polypeptide from the animal individual or plant individual. 
>5 [0258] Examples of the method for producing the polypeptide of the present invention using the animal individual 
include a method for producing the polypeptide of the present invention in an animal developed by inserting a gene 
according to methods known to those of ordinary skill in the art (American Journal of Clinical Nutrition, 63: 639S (1 996), 
American Journal of Clinical Nutrition, 63: 627S (1 996), Bio/Technology, 9: 830 (1991 )). 
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[0259] In the animal individual, the polypeptide can be produced by breeding a transgenic nonhuman animal to which 
the DNA encoding the polypeptide of the present invention has been inserted to produce and accumulate the polypep- 
tide in the animal, and recovering the polypeptide from the animal. Examples of the production and accumulation place 
in the animal include milk (Japanese Published Unexamined Patent Application No. 309192/88). egg and the like of 
5 the animal. Any promoter can be used, so long as it can be expressed in the animal. Suitable examples include an a- 
casein promoter, a (p-casein promoter, a p-lactoglobulin promoter, a whey acidic protein promoter, and the like, which 
are specific for mammary glandular cells. 

[0260] Examples of the method for producing the polypeptide of the present invention using the plant individual 
include a method for producing the polypeptide of the present invention by cultivating a transgenic plant to which the 
10 DNA encoding the protein of the present invention by a known method (Tissue Culture, 20 (1 994), Tissue Culture, 21 
(1 994), Trends in Biotechnology, 15:45 (1 997)) to produce and accumulate the polypeptide in the plant, and recovering 
the polypeptide from the plant. 

[0261] The polypeptide according to the present invention can also be obtained by translation in vitro. 

[0262] The polypeptide of the present invention can be produced by a translation system in vitro. There are, for 

is example, two in vitro translation methods which may be used, namely, a method using RNA as a template and another 
method using DNA as a template. The template RNA includes the whole RNA, mRNA, an in vitro transcription product, 
and the like. The template DNA includes a plasmid containing a transcriptional promoter and a target gene integrated 
therein and downstream of the initiation site, a PCR/RT-PCR product and the like. To select the most suitable system 
for the in vitro translation, the origin of the gene encoding the protein to be synthesized (prokaryotic cell/eucaryotic 

20 cell), the type of the template (DNA/RNA), the purpose of using the synthesized protein and the like should be consid- 
ered. In vitro translation kits having various characteristics are commercially available from many companies (Boe- 
hringer Mannheim , Promega, Stratagene, or the like), and every kit can be used in producing the polypeptide according 
to the present invention. 

[0263] Transcription/translation of a DNA nucleotide sequence cloned into a plasmid containing a T7 promoter can 
25 be carried out using an in vitro transcription/translation system £ co//T7 S30 Extract System for Circular DNA (man- 
ufactured by Promega, catalogue No. L1130). Also, transcription/translation using, as a template, a linear prokaryotic 
DNA of a supercoil non-sensitive promoter, such as /adJV5, tac, XPL(con), XPL, or the like, can be carried out using 
an in vitro transcription/translation system E. coliS30 Extract System for Linear Templates (manufactured by Promega, 
catalogue No. L1030). Examples of the linear prokaryotic DNA used as a template include a DNA fragment, a PCR- 
30 amplified DNA product, a duplicated oligonucleotide ligation, an in vitro transcriptional RNA, a prokaryotic RNA, and 
the like. 

[0264] In addition to the production of the polypeptide according to the present invention, synthesis of a radioactive 
labeled protein, confirmation of the expression capability of a cloned gene, analysis of the function of transcriptional 
reaction or translation reaction, and the like can be carried out using this system. 

35 [0265] The polypeptide produced by the transformant of the present invention can be isolated and purified using the 
general method for isolating and purifying an enzyme. For example, when the polypeptide of the present invention is 
expressed as a soluble product in the host cells, the cells are collected by centrifugation after cultivation, suspended 
in an aqueous buffer, and disrupted using an ultrasonicator, a French press, a Manton Gaulin homogenizer, a Dynomill, 
or the like to obtain a cell-free extract. From the supernatant obtained by centrifuging the cell-free extract, a purified 

40 product can be obtained by the general method used for isolating and purifying an enzyme, for example, solvent ex- 
traction, salting out using ammonium sulfate or the like, desalting, precipitation using an organic solvent, anion ex- 
change chromatography using a resin, such as diethylaminoethyl (DEAE)-Sepharose ; DIAION HPA-75 (manufactured 
by Mitsubishi Chemical) or the like, cation exchange chromatography using a resin, such as S-Sepharose FF (manu- 
factured by Pharmacia) or the like, hydrophobic chromatography using a resin, such as butyl sepharose, phenyl sepha- 

45 rose or the like, gel filtration using a molecular sieve, affinity chromatography, chromatofocusing, or electrophoresis, 
such as isoelectronic focusing or the like, alone or in combination thereof. 

[0266] When the polypeptide is expressed as an insoluble product in the host cells, the cells are collected in the 
same manner, disrupted and centrifuged to recover the insoluble product of the polypeptide as the precipitate fraction. 
Next, the insoluble product of the polypeptide is solubilized with a protein denaturing agent. The solubilized solution 
50 is diluted or dialyzed to lower the concentration of the protein denaturing agent in the solution. Thus, the normal con- 
figuration of the polypeptide is reconstituted. After the procedure, a purified product of the polypeptide can be obtained 
by a purification/isolation method similar to the above. 

[0267] When the polypeptide of the present invention or its derivative (for example, a polypeptide formed by adding 
a sugar chain thereto) is secreted out of cells, the polypeptide or its derivative can be collected in the culture supernatant. 
55 Namely, the culture supernatant is obtained by treating the culture medium in a treatment similar to the above (for 
example, centrifugation). Then, a purified product can be obtained from the culture medium using a purification/isolation 
method similar to the above. 

[0268] The polypeptide obtained by the above method is within the scope of the polypeptide of the present invention, 
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and examples include a polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from 
SEQ ID NOS:2 to 3431, and a polypeptide comprising an amino acid sequence represented by any one of SEQ ID 
NOS:3502 to 6931 . 

[0269] Furthermore, a polypeptide comprising an amino acid sequence in which at least one amino acids is deleted, 

5 replaced, inserted or added in the amino acid sequence of the polypeptide and having substantially the same activity 
as that of the polypeptide is included in the scope of the present invention. The term "substantially the same activity 
as that of the polypeptide" means the same activity represented by the inherent function, enzyme activity or the like 
possessed by the polypeptide which has not been deleted, replaced, inserted or added. The polypeptide can be ob- 
tained using a method for introducing part-specific mutation(s) described in, for example, Molecular Cloning, 2nd ed., 

10 Current Protocols in Molecular Biology, Nuc. Acids. Res., 10. 6487 (1982) : Proc. Natl. Acad. Sci. USA, 7£ 6409 (1982), 
Gene, 34: 315 (1985), Nuc. Acids. Res., 13: 4431 (1985), Proc. Natl. Acad. Sci. USA, 82: 488 (1985) and the like. For 
example, the polypeptide can be obtained by introducing mutation(s) to DNA encoding a polypeptide having the amino 
acid sequence represented by any one of SEQ ID NOS:3502 to 6931 . The number of the amino acids which are deleted, 
replaced, inserted or added is not particularly limited; however, it is usually 1 to the order of tens, preferably 1 to 20, 

15 more preferably 1 to 10, and most preferably 1 to 5, amino acids. 

[0270] The at least one amino acid deletion, replacement, insertion or addition in the amino acid sequence of the 
polypeptide of the present invention is used herein to refer to that at least one amino acid is deleted, replaced, inserted 
or added to at one or plural positions in the amino acid sequence. The deletion, replacement, insertion or addition may 
be caused in the same amino acid sequence simultaneously. Also, Ihe amino acid residue replaced, inserted or added 

20 can be natural or non-natural. Examples of the natural amino acid residue include L-alanine, L-asparagine, L-asparatic 
acid, L-glutamine : L-glutamic acid, glycine, L-histidine, L-isoleucine, L-leucine, L-lysine, L-methionine, L-phenylalanine, 
L-proline, L-serine, L-threonine, L-tryptophan, L-tyrosine, L-valine, L-cysteine, and the like. 

[0271] Herein, examples of amino acid residues which are replaced with each other are shown below. The amino 
acid residues in the same group can be replaced with each other. 

25 

Group A: 

[0272] leucine, isoleucine, norleucine, valine, norvaline, alanine, 2-aminobutanoic acid, methionine, O-methylserine, 
t-butylglycine, t-butylalanine, cyclohexylalanine; 

30 

Group B: 

[0273] asparatic acid, glutamic acid, isoasparatic acid, isoglutamic acid, 2-aminoadipic acid, 2-aminosuberic acid; 
35 Group C: 

[0274] asparagine, glutamine; 
Group D: 

40 

[0275] lysine, arginine, ornithine, 2,4-diaminobutanoic acid, 2,3-diaminopropionic acid; 
Group E: 

45 [0276] proline, 3-hydroxyproline, 4-hydroxyproline; 
Group F: 

[0277] serine, threonine, homoserine; 

50 

Group G: 

[0278] phenylalanine, tyrosine. 

[0279] Also, in order that the resulting mutant polypeptide has substantially the same activity as that of the polypeptide 
55 which has not been mutated, it is preferred that the mutant polypeptide has a homology of 60% or more, preferably 
80% or more, and particularly preferably 95% or more, with the polypeptide which has not been mutated, when calcu- 
lated, for example, using default (initial setting) parameters by a homology searching software, such as BLAST, FASTA, 
or the like. 
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[0280] Also, the polypeptide of the present invention can be produced by a chemical synthesis method, such as 
Fmoc (fluorenylmethyloxycarbonyl) method, tBoc (t-butyloxycarbonyl) method, or the like. It can also be synthesized 
using a peptide synthesizer manufactured by Advanced ChemTech, Perkin-Elmer, Pharmacia, Protein Technology 
Instrument, Synthecell-Vega, PerSeptive, Shimadzu Corporation, or the like. 
5 [0281 ] The transf ormant of the present invention can be used for objects other than the production of the polypeptide 
of the present invention. 

[0282] Specifically, at least one component selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an 
organic acid, and analogues thereof can be produced by culturing the transformant containing the polynucleotide or 
recombinant vector of the present invention in a medium to produce and accumulate at least one component selected 
10 from amino acids, nucleic acids, vitamins, saccharides, organic acids, and analogues thereof, and recovering the same 
from the medium. 

[0283] The biosynthesis pathways, decomposition pathways and regulatory mechanisms of physiologically active 
substances such as amino acids, nucleic acids, vitamins, saccharides, organic acids and analogues thereof differfrom 
organism to organism. The productivity of such a physiologically active substance can be improved using these differ- 

15 ences, specifically by introducing a heterogeneous gene relating to the biosynthesis thereof. For example, the content 
of lysine, which is one of the essential amino acids, in a plant seed was improved by introducing a synthase gene 
derived from a bacterium (WO 93/1 91 90). Also, arginine is excessively produced in a culture by introducing an arginine 
synthase gene derived from Escherichia coii (Japanese Examined Patent Publication 23750/93). 
[0284] To produce such a physiologically active substance, the transformant according to the present invention can 

20 be cultured by the same method as employed in culturing the transformant for producing the polypeptide of the present 
invention as described above. Also, the physiologically active substance can be recovered from the culture medium 
in combination with, for example, the ion exchange resin method, the precipitation method and other known methods. 
[0285] Examples of methods known to one of ordinary skill in the art include electroporation, calcium transfection, 
the protoplast method, the method using a phage, and the like, when the host is a bacterium; and microinjection, 

25 calcium phosphate transfection, the positively charged lipid-mediated method and the method using a virus, and the 
like, when the host is a eukaryote (Molecular Cloning, 2nd ed.; Spector era/., Cells/a laboratory manual, Cold Spring 
Harbour Laboratory Press, 1998)). Examples of the host include prokaryotes, lower eukaryotes (for example, yeasts), 
higher eukaryotes (for example, mammals), and cells isolated therefrom. As the state of a recombinant polynucleotide 
fragment present in the host cells ; it can be integrated into the chromosome of the host. Alternatively, it can be integrated 

30 into a factor (for example, a plasmid) having an independent replication unit outside the chromosome. These trans- 
formants are usable in producing the polypeptides of the present invention encoded by the ORF of the genome of 
Corynebacterium glutamicum, the polynucleotides of the present invention and fragments thereof. Alternatively, they 
can be used in producing arbitrary polypeptides under the regulation by an EMF of the present invention. 

35 11. Preparation of antibody recognizing the polypeptide of the present invention 

[0286] An antibody which recognizes the polypeptide of the present invention, such as a polyclonal antibody, a mon- 
oclonal antibody, or the like, can be produced using, as an antigen, a purified product of the polypeptide of the present 
invention or a partial fragment polypeptide of the polypeptide or a peptide having a partial amino acid sequence of the 
40 polypeptide of the present invention. 

(1) Production of polyclonal antibody 

[0287] A polyclonal antibody can be produced using, as an antigen, a purified product of the polypeptide of the 
45 present invention, a partial fragment polypeptide of the polypeptide, or a peptide having a partial amino acid sequence 
of the polypeptide of the present invention, and immunizing an animal with the same. 

[0288] Examples of the animal to be immunized include rabbits, goats : rats, mice, hamsters, chickens and the like. 
[0289] A dosage of the antigen is preferably 50 to 1 00 |xg per animal. 

[0290] When the peptide is used as the antigen , it is preferably a peptide covalently bonded to a carrier protein , such 
so as keyhole limpet haemocyanin, bovine thyroglobulin, or the like. The peptide used as the antigen can be synthesized 
by a peptide synthesizer. 

[0291] The administration of the antigen is, for example, carried out 3 to 10 times at the intervals of 1 or 2 weeks 
after the first administration. On the 3rd to 7th day after each administration, a blood sample is collected from the 
venous plexus of the eyeground, and it is conf irmed that the serum reacts with the antigen by the enzyme immunoassay 
55 (Enzyme-linked Immunosorbent Assay (ELISA), Igaku Shoin (1 976) ; Antibodies - A Laboratory Manual, Cold Spring 
Harbor Laboratory (1988)) or the like. 

[0292] Serum is obtained from the immunized non-human mammal with a sufficient antibody titer against the antigen 
used for the immunization, and the serum is isolated and purified to obtain a polyclonal antibody. 
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[0293] Examples of the method for the isolation and purification include centrifugation, salting out by 40-50% satu- 
rated ammonium sulfate, caprylic acid precipitation {Antibodies, A Laboratory manual, Cold Spring Harbor Laboratory 
(1988)), or chromatography using a DEAE-Sepharose column., an anion exchange column., a protein A- orG-column, 
a gel filtration column, and the like, alone or in combination thereof, by methods known to those of ordinary skill in the art. 

(2) Production of monoclonal antibody 

(a) Preparation of antibody-producing cell 

10 [0294] A rat having a serum showing an enough antibody titer against a partial fragment polypeptide of the polypep- 
tide of the present invention used for immunization is used as a supply source of an antibody-producing cell. 
[0295] On the 3rd to 7th day after the antigen substance is finally administered the rat showing the antibody titer, the 
spleen is excised. 

[0296] The spleen is cut to pieces in MEM medium (manufactured by Nissui Pharmaceutical), loosened using a pair 
is of forceps, followed by centrifugation at 1 ,200 rpm for 5 minutes, and the resulting supernatant is discarded. 

[0297] The spleen in the precipitated fraction is treated with a Tris-ammonium chloride buffer (pH 7.65) for 1 to 2 
minutes to eliminate erythrocytes and washed three times with MEM medium, and the resulting spleen cells are used 
as antibody-producing cells. 

20 (b) Preparation of myeloma cells 

[0298] As myeloma cells, an established cell line obtained from mouse or rat is used. Examples of useful cell lines 
include those derived from a mouse, such as P3-X63Ag8-U1 (hereinafter referred to as "P3-U1 ") (Curr. Topics in Micro- 
biol. Immunol, 81: 1 (1978); Europ. J. Immunol., 6: 511 (1976)); SP2/0-Agl4 (SP-2) (Nature, 276: 269 (1978)): 

25 P3-X63-AgB653 (653) (J. Immunol., 123: 1548 (1 979)); P3-X63-Ag8 (X63) cell line (Nature, 256: 495 (1 975)), and the 
like, which are 8-azaguanine-resistant mouse (BALB/c) myeloma cell lines. These cell lines are subcultured in 8-aza- 
guanine medium (medium in which, to a medium obtained by adding 1.5 mmol/l glutamine, 5x10-5 mol/l 2-mercap- 
toethanol, 10 ^ig/ml gentamicin and 10% fetal calf serum (FCS) (manufactured by CSL) to RPMI-1640 medium (here- 
inafter referred to as the "normal medium"), 8-azaguanine is further added at 15 ug/ml) and cultured in the normal 

30 medium 3 or 4 days before cell fusion, and 2x10 7 or more of the cells are used for the fusion. 

(c) Production of hybridoma 

[0299] The antibody-producing cells obtained in (a) and the myeloma cells obtained in (b) are washed with MEM 
medium or PBS (disodium hydrogen phosphate: 1 .83 g, sodium dihydrogen phosphate: 0.21 g, sodium chloride: 7.65 
g ; distilled water: 1 liter pH: 7.2) and mixed to give a ratio of antibody-producing cells : myeloma cells = 5 : 1 to 10 : 
1 , followed by centrifugation at 1 ,200 rpm for 5 minutes, and the supernatant is discarded. 

[0300] The cells in the resulting precipitated fraction were thoroughly loosened, 0.2 to 1 ml of a mixed solution of 2 
g of polyethylene glycol-1000 (PEG-1000), 2 ml of MEM medium and 0.7 ml of dimethylsulfoxide (DMSO) per 10* 
antibody-producing cells is added to the cells under stirring at 37°C : and then 1 to 2 ml of MEM medium is further 
added thereto several times at 1 to 2 minute intervals. 

[0301] After the addition, MEM medium is added to give a total amount of 50 ml. The resulting prepared solution is 
centrifuged at 900 rpm for 5 minutes, and then the supernatant is discarded. The cells in the resulting precipitated 
fraction were gently loosened and then gently suspended in 1 00 ml of HAT medium (the normal medium to which 1 0' 4 
mol/l hypoxanthine, 1.5X10- 5 mol/l thymidine and 4X10 7 mol/l aminopterin have been added) by repeated drawing 
up into and discharging from a measuring pipette. 

[0302] The suspension is poured into a 96 well culture plate at 100 jjJ/well and cultured at 37°C for 7 to 14 days in 
a 5% C0 2 incubator. 

[0303] After culturing, a part of the culture supernatant is recovered, and a hybridoma which specifically reacts with 
a partial fragment polypeptide of the polypeptide of the present invention is selected according to the enzyme immu- 
noassay described in Antibodies, A Laboratory manual, Cold Spring Harbor Laboratory, Chapter 14 (1998) and the like. 
[0304] A specific example of the enzyme immunoassay is described below. 

[0305] The partial fragment polypeptide of the polypeptide of the present invention used as the antigen in the immu- 
nization is spread on a suitable plate, is allowed to react with a hybridoma culturing supernatant or a purified antibody 
obtained in (d) described below as a first antibody, and is further allowed to react with an anti-rat or anti-mouse immu- 
noglobulin antibody labeled with an enzyme, a chemical luminous substance, a radioactive substance or the like as a 
second antibody for reaction suitable for the labeled substance. A hybridoma which specifically reacts with the polypep- 
tide of the present invention is selected as a hybridoma capable of producing a monoclonal antibody of the present 



35 



40 



45 



50 



55 



29 

BNSDOCID: <EP 1 1G879QA2 I. > 



EP 1 108 790 A2 



invention. 

[0306] Cloning is repeated using the hybridoma twice by limiting dilution analysis (HT medium (a medium in which 
aminopterin has been removed from HAT medium) is firstly used, and the normal medium is secondly used), and a 
hybridoma which is stable and contains a sufficient amount of antibody titer is selected as a hybridoma capable of 
5 producing a monoclonal antibody of the present invention. 

(d) Preparation of monoclonal antibody 

[0307] The monoclonal antibody-producing hybridoma cells obtained in (c) are injected intraperitoneally into 8- to 
10 10-week-old mice or nude mice treated with pristane (intraperitoneal administration of 0.5 ml of 2,6,1 0,1 4-tetrameth- 
ylpentadecane (pristane), followed by 2 weeks of feeding) at 5x10^ to 20x10 s cells/animal. The hybridoma causes 
ascites tumor in 10 to 21 days. 

[0308] The ascitic fluid is collected from the mice or nude mice, and centrifuged to remove solid contents at 3000 
rpm for 5 minutes. 

15 [0309] A monoclonal antibody can be purified and isolated from the resulting supernatant according to the method 
similar to that used in the polyclonal antibody. 

[0310] The subclass of the antibody can be determined using a mouse monoclonal antibody typing kit or a rat mon- 
oclonal antibody typing kit. The polypeptide amount can be determined by the Lowry method or by calculation based 
on the absorbance at 280 nm. 

20 [0311] The antibody obtained in the above is within the scope of the antibody of the present invention. 

[0312] The antibody can be used for the general assay using an antibody, such as a radioactive material labeled 
immunoassay (RIA), competitive binding assay, an immunotissue chemical staining method (ABC method, CSA meth- 
od, etc.), immunoprecipitation, Western blotting, ELISA assay, and the like (An introduction to Radioimmunoassay and 
Related Techniques, Elsevier Science (1986); Techniques in Immunocytochemistry Academic Press, Vol. 1 (1982), 

25 Vol 2 (1 983) & Vol. 3 (1 985); Practice and Theory of Enzyme Immunoassays, Elsevier Science (1 985); Enzyme-linked 
Immunosorbent Assay (ELISA), Igaku Shoin (1 976) ; Antibodies - A Laboratory Manual, Cold Spring Harbor laboratory 
(1 988) ; Monoclonal Antibody Experiment Manual, Kodansha Scientific ( 1 987) ; Second Series Biochemical Experiment 
Course, Vol. 5, Immunobiochemistry Research Method, Tokyo Kagaku Dojin (1986)). 
[0313] The antibody of the present invention can be used as it is or after being labeled with a label. 

30 [0314] Examples of the label include radioisotope, an affinity label (e.g., biotin, avidin, orthe like), an enzyme label 
(e g horseradish peroxidase, alkaline phosphatase, or the like), a fluorescence label (e.g., FITC, rhodamine, or the 
like),' a label using a rhodamine atom, U Histochem. Cytochem., 18: 315 (1970); Meth. Enzym., 62: 308 (1979); Im- 
munol., 109: 129 (1972); J. Immunol., Meth., 13: 215 (1979)), and the like. 

[031 5] Expression of the polypeptide of the present invention, fluctuation of the expression, the presence or absence 
35 of structural change of the polypeptide, and the presence or absence in an organism other than coryneform bacteria 
of a polypeptide corresponding to the polypeptide can be analyzed using the antibody or the labeled antibody by the 
above assay, or a polypeptide array or proteome analysis described below. 

[0316] Furthermore, the polypeptide recognized by the antibody can be purified by immunoaffinity chromatography 
using the antibody of the present invention. 



40 



12. Production and use of polypeptide array 
(1) Production of polypeptide array 

45 [0317] A polypeptide array can be produced using the polypeptide of the present invention obtained in the above 
item 10 or the antibody of the present invention obtained in the above item 11 . 

[0318] The polypeptide array of the present invention includes protein chips, and comprises a solid support and the 
polypeptide or antibody of the present invention adhered to the surface of the solid support. 

[0319] Examples of the solid support include plastic such as polycarbonate or the like; an acrylic resin, such as 
so polyacrylamide or the like; complex carbohydrates, such as agarose, sepharose, or the like; silica; a silica-based ma- 
terial, carbon, a metal, inorganic glass, latex beads, and the like. 

[0320] The polypeptides or antibodies according to the present invention can be adhered to the surface of the solid 
support according to the method described in Biotechniques, 27: 1 258-61 (1 999); Molecular Medicine Today, 5: 326-7 
(1999); Handbook of Experimental Immunology, 4th edition, Blackwell Scientific Publications, Chapter 10 (1986); Meth. 
55 Enzym., 34 (1974); Advances in Experimental Medicine and Biology, 42 (1974); U.S. Patent 4,681 ,870; U.S. Patent 
4,282,287; U .S. Patent 4,762,881 , or the like. 

[0321] The analysis described herein can be efficiently performed by adhering the polypeptide or antibody of the 
present invention to the solid support at a high density, though a high fixation density is not always necessary. 
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(2) Use of polypeptide array 

[0322] A polypeptide or a compound capable of binding to and interacting with the polypeptides of the present in- 
vention adhered to the array can be identified using the polypeptide array to which the polypeptides of the present 
5 invention have been adhered thereto as described in the above (1). 

[0323] Specifically, a polypeptide or a compound capable of binding to and interacting with the polypeptides of the 
present invention can be identified by subjecting the polypeptides of the present invention to the following steps (i) to (iv): 

(i) preparing a polypeptide array having the polypeptide of the present invention adhered thereto by the method 
10 of the above (1); 

(ii) incubating the polypeptide immobilized on the polypeptide array together with at least one of a second polypep- 
tide or compound; 

(iii) detecting any complex formed between the at least one of a second polypeptide or compound and the polypep- 
tide immobilized on the array using, for example, a label bound to the at least one of a second polypeptide or 

15 compound, or a secondary label which specifically binds to the complex or to a component of the complex after 

unbound material has been removed; and 

(iv) analyzing the detection data. 

[0324] Specific examples of the polypeptide array to which the polypeptide of the present invention has been adhered 
20 include a polypeptide array containing a solid support to which at least one of a polypeptide containing an amino acid 
sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide containing an amino acid sequence in which at 
least one amino acids is deleted, replaced, inserted or added in the amino acid sequence of the polypeptide and having 
substantially the same activity as that of the polypeptide, a polypeptide containing an amino acid sequence having a 
homology of 60% or more with the amino acid sequences of the polypeptide and having substantially the same activity 
25 as that of the polypeptides, a partial fragment polypeptide, and a peptide comprising an amino acid sequence of a part 
of a polypeptide. 

[0325] The amount of production of a polypeptide derived from coryneform bacteria can be analyzed using a polypep- 
tide array to which the antibody of the present invention has been adhered in the above (1). 

[0326] Specifically, the expression amount of a gene derived from a mutant of coryneform bacteria can be analyzed 
30 by subjecting the gene to the following steps (i) to (iv): 

(i) preparing a polypeptide array by the method of the above (1); 

(ii) incubating the polypeptide array (the first antibody) together with a polypeptide derived from a mutant of co- 
ryneform bacteria; 

35 (iii) detecting the polypeptide bound to the polypeptide immobilized on the array using a labeled second antibody 

of the present invention; and 
(iv) analyzing the detection data. 

[0327] Specific examples of the polypeptide array to which the antibody of the present invention is adhered include 
40 a polypeptide array comprising a solid support to which at least one of an antibody which recognizes a polypeptide 
comprising an amino acid sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide comprising an amino 
acid sequence in which at least one amino acids is deleted, replaced, inserted or added in the amino acid sequence 
of the polypeptide and having substantially the same activity as that of the polypeptide, a polypeptide comprising an 
amino acid sequence having a homology of 60% or more with the amino acid sequences of the polypeptide and having 
45 substantially the same activity as that of the polypeptides, a partial fragment polypeptide, or a peptide comprising an 
amino acid sequence of a part of a polypeptide. 

[0328] A fluctuation in an expression amount of a specific polypeptide can be monilored using a polypeptide obtained 
in the time course of culture as the polypeptide derived from coryneform bacteria. The culturing conditions can be 
optimized by analyzing the fluctuation. 
50 [0329] When a polypeptide derived from a mutant of coryneform bacteria is used, a mutated polypeptide can be 
detected. 

13. Identification of useful mutation in mutant by proteome analysis 

55 [0330] Usually, the proteome is used herein to refer to a method wherein a polypeptide is separated by twodimen- 
sional electrophoresis and the separated polypeptide is digested with an enzyme, followed by identification of the 
polypeptide using a mass spectrometer (MS) and searching a data base. 

[0331] The two dimensional electrophoresis means an electrophoretic method which is performed by combining two 
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electrophoretic procedures having different principles. For example, polypeptides are separated depending on molec- 
ular weight in the primary electrophoresis. Next, the gel is rotated by 90° or 180° and the secondary electrophoresis 
is carried out depending on isoelectric point. Thus, various separation patterns can be achieved (JIS K 3600 2474). 
[0332] In searching the data base, the amino acid sequence information of the polypeptides of the present invention 
5 and the recording medium of the present invention provide for in the above items 2 and 8 can be used. 

[0333] The proteome analysis of a coryneform bacterium and its mutant makes it possible to identify a polypeptide 
showing a fluctuation therebetween. 

[0334] The proteome analysis of a wild type strain of coryneform bacteria and a production strain showing an im- 
proved productivity of a target product makes it possible to efficiently identify a mutation protein which is useful in 

10 breeding for improving the productivity of a target product or a protein of which expression amount is fluctuated. 

[0335] Specifically, a wild type strain of coryneform bacteria and a lysine-producing strain thereof are each subjected 
to the proteome analysis. Then, a spot increased in the lysine-producing strain, compared with the wild type strain, is 
found and a data base is searched so that a polypeptide showing an increase in yield in accordance with an increase 
in the lysine productivity can be identified. For example, as a result of the proteome analysis on a wild type strain and 

15 a lysine-producing strain, the productivity of the catalase having the amino acid sequence represented by SEQ ID NO: 
3785 is increased in the lysine-producing mutant. 

[0336] As a result that a protein having a high expression level Is identified by proteome analysis using the nucleotide 
sequence information and the amino acid sequence information, of the genome of the coryneform bacteria of the 
present invention, and a recording medium storing the sequences, the nucleotide sequence of the gene encoding this 
20 protein and the nucleotide sequence in the upstream thereof can be searched at the same time, and thus, a nucleotide 
sequence having a high expression promoter can be efficiently selected. 

[0337] In the proteome analysis, a spot on the two-dimentional electrophoresis gel showing a fluctuation is sometimes 
derived from a modified protein. However, the modified protein can be efficiently identified using the recording medium 
storing the nucleotide sequence information, the amino acid sequence information, of the genome of coryneform bac- 

25 teria, and the recording medium storing the sequences, according to the present invention. 

[0338] Moreover, a useful mutation point in a useful mutant can be easily specified by searching a nucleotide se- 
quence (nucleotide sequence of promoters, ORF, or the like) relating to the thus identified protein using a recording 
medium storing the nucleotide sequence information and the amino acid sequence information, of the genome of 
coryneform bacteria of the present invention, and a recording medium storing the sequences and using a primer de- 

30 signed on the basis of the detected nucleotide sequence. As a result that the useful mutation point is specified, an 
industrially useful mutant having the useful mutation or other useful mutation derived therefrom can be easily bred. 
[0339] The present invention will be explained in detail below based on Examples. However, the present invention 
is not limited thereto. 

35 Example 1 

Determination of the full nucleotide sequence of genome of Corynebacterium glutamicum 

[0340] The full nucleotide sequence of the genome of Corynebacterium glutamicum was determined based on the 
40 whole genome shotgun method {Science, 269: 496-512 (1995)). In this method, a genome library was prepared and 
the terminal sequences were determined at random. Subsequently, these sequences were ligated on a computer to 
cover the full genome. Specifically, the following procedure was carried out. 

(1) Preparation of genome DNA of Corynebacterium glutamicum ATCC 13032 

45 

[0341] Corynebacterium glutamicum ATCC 1 3032 was cultured in BY medium (7 g/l meat extract, 1 0 g/l peptone, 3 
g/f sodium chloride, 5 g/l yeast extract, pH 7.2) containing 1 % of glycine at 30°C overnight and the cells were collected 
by centrifugation. After washing with STE buffer (10.3% sucrose, 25 mmol/l Tris hydrochloride, 25 mmol/l EDTA, pH 
8.0), the cells were suspended in 10 ml of STE buffer containing 10 mg/ml lysozyme, followed by gently shaking at 

so 37°C for 1 hour. Then, 2 ml of 1 0% SDS was added thereto to lyse the cells, and the resultant mixture was maintained 
at 65°C for 1 0 minutes and then cooled to room temperature. Then, 1 0 ml of Tris-neutralized phenol was added thereto, 
followed by gently shaking at room temperature for 30 minutes and centrifugation (1 5,000 x g, 20 minutes, 20°C). The 
aqueous layer was separated and subjected to extraction with phenol/chloroform and extraction with chloroform (twice) 
in the same manner. To the aqueous layer, 3 mol/l sodium acetate solution (pH 5.2) and isopropanol were added at 

55 1/10 times volume and twice volume, respectively, followed by gently stirring to precipitate the genome DNA. The 
genome DNA was dissolved again in 3 ml of TE buffer (1 0 mmol/l Tris hydrochloride, 1 mmol/l EDTA, pH 8.0) containing 
0.02 mg/ml of RNase and maintained at 37°C for 45 minutes. The extractions with phenol, phenol/chloroform and 
chloroform were carried out successively in the same manner as the above. The genome DNA was subjected to iso- 
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propanol precipitation. The thus formed genome DNA precipitate was washed with 70% ethanol three times, followed 
by air-drying, and dissolved in 1 .25 ml of TE buffer to give a genome DNA solution (concentration: 0.1 mg/ml). 

(2) Construction of a shotgun library 

5 

[0342] TE buffer was added to 0.01 mg of the thus prepared genome DNA of Cory nebacte hum glutamicum ATCC 
13032 to give a total volume of 0.4 ml, and the mixture was treated with a sonicator (Yamato Powersonic Model 150) 
at an output of 20 continuously for 5 seconds to obtain fragments of 1 to 10 kb. The genome fragments were blunt- 
ended using a DNA blunting kit (manufactured by Takara Shuzo) and then fractionated by 6% polyacrylamide gel 

10 electrophoresis. Genome fragments of 1 to 2 kb were cut out from the gel, and 0.3 ml MG elution buffer (0.5 mol/l 
ammonium acetate, 1 0 mmol/l magnesium acetate, 1 mmol/l EDTA, 0. 1 % SDS) was added thereto, followed by shaking 
at 37°C overnight to elute DNA. The DNA eluate was treated with phenol/chloroform, and then precipitated with ethanol 
to obtain a genome library insert. The total insert and 500 ng of pUC1 8 Smal/B AP (manufactured by Amersham Phar- 
macia Biotech) were ligated at 1 6°C for 40 hours. 

15 [0343] The ligation product was precipitated with ethanol and dissolved in 0.01 ml of TE buffer. The ligation solution 
(0.001 ml) was introduced into 0.04 ml of E. coli ELECTRO MAX DH10B (manufactured by Life Technologies) by the 
electroporation under conditions according to the manufacture's instructions. The mixture was spread on LB plate 
medium (LB medium (10 g/l bactotrypton, 5 g/l yeast extract, 10 g/l sodium chloride, pH 7.0) containing 1.6% of agar) 
containing 0. 1 mg/ml ampicillin, 0.1 mg/ml X-gal and 1 mmol/l isopropyl-p-D-thiogafactopyranoside (IPTG) and cultured 

20 at 37°C overnight. 

[0344] The transformant obtained from colonies formed on the plate medium was stationarily cultured in a 96-well 
titer plate having 0.05 ml of LB medium containing 0.1 mg/ml ampicillin at 37°C overnight. Then, 0.05 ml of LB medium 
containing 20% glycerol was added thereto, followed by stirring to obtain a glycerol stock. 

25 (3) Construction of cosmid library 

[0345] About 0.1 mg of the genome DNA of Corynebacterium glutamicum ATCC 13032 was partially digested with 
Sau3A\ (manufactured by Takara Shuzo) and then ultracentrifuged (26,000 rpm, 18 hours, 20°C) under 10 to 40% 
sucrose density gradient obtained using 10% and 40% sucrose buffers (1 mol/l NaCI, 20 mmol/l Tris hydrochloride, 5 

30 mmol/l EDTA, 1 0% or 40% sucrose, pH 8.0). After the centrifugation, the solution thus separated was fractionated into 
tubes at 1 ml in each tube. After confirming the DNA fragment length of each fraction by agarose gel electrophoresis, 
a fraction containing a large amount of DNA fragment of about 40 kb was precipitated with ethanol. 
[0346] The DNA fragment was ligated to the BamHI site of superCosI (manufactured by Stratagene) in accordance 
with the manufacture's instructions. The ligation product was incorporated into Escherichia co// XL-1-BlueMR strain 

35 (manufactured by Stratagene) using Gigapack III Gold Packaging Extract (manufactured by Stratagene) in accordance 
with the manufacture's instructions. The Escherichia coli was spread on LB plate medium containing 0.1 mg/ml amp- 
icillin and cultured therein at 37°C overnight to isolate colonies. The resulting colonies were stationarily cultured at 
37°C overnight in a 96-well titer plate containing 0.05 ml of the LB medium containing 0.1 mg/ml ampicillin in each 
well. LB medium containing 20% glycerol (0.05 ml) was added thereto, followed by stirring to obtain a glycerol stock. 

(4) Determination of nucleotide sequence 

(4-1) Preparation of template 

^5 [0347] The full nucleotide sequence of Corynebacterium glutamicum ATCC 1 3032 was determined mainly based on 
the whole genome shotgun method. The template used in the whole genome shotgun method was prepared by the 
PCR method using the library prepared in the above (2). 

[0348] Specifically, the clone derived from the whole genome shotgun library was inoculated using a replicator (man- 
ufactured by GENETIX) into each well of a 96-well plate containing the LB medium containing 0.1 mg/ml of ampicillin 

50 at 0.08 ml per each well and then stationarily cultured at 37°C overnight. 

[0349] Next, the culturing solution was transported using a copy plate (manufactured by Tokken) into a 96-well re- 
action plate (manufactured by PE Biosystems) containing a PCR reaction solution (TaKaRa Ex Taq (manufactured by 
Takara Shuzo)) at 0.08 ml per each well. Then, PCR was carried out in accordance with the protocol by Makino et at. 
(DNA Research, 5: 1 -9 (1998)) using GeneAmp PCR System 9700 (manufactured by PE Biosystems) to amplify the 

55 inserted fragment. 

[0350] The excessive primers and nucleotides were eliminated using a kit for purifying a PCR production (manufac- 
tured by Amersham Pharmacia Biotech) and the residue was used as the template in the sequencing reaction. 
[0351] Some nucleotide sequences were determined using a double-stranded DNA plasmid as a template. 
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[0352] The double-stranded DNA plasmid as the template was obtained by the following method. 
[0353] The clone derived from the whole genome shotgun library was inoculated into a 24- or 96-well plate containing 
a 2x YT medium (1 6 g/l bactotrypton, 1 0 g/l yeast extract, 5 g/i sodium chloride, pH 7.0) containing 0.05 mg/ml ampicillin 
at 1 .5 ml per each well and then cultured under shaking at 37°C overnight. 
5 [0354] The double-stranded DNA plasmid was prepared from the culturing solution using an automatic plasmid pre- 
paring machine, KURABO PI-50 (manufactured by Kurabo Industries) or a multiscreen (manufactured by Millipore) in 
accordance with the protocol provided by the manufacturer. 

[0355] To purify the double-stranded DNA plasmid using the multiscreen, Biomek 2000 (manufactured by Beckman 
Coulter) or the like was employed. 
10 [0356] The thus obtained double-stranded DNA plasmid was dissolved in water to give a concentration of about 0.1 
mg/ml and used as the template in sequencing. 

(4-2) Sequencing reaction 

is [0357] To 6 \i\ of a solution of ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
by PE Biosystems), an M13 regular direction primer (M13-21) or an M13 reverse direction primer (M13REV) (DNA 
Research, 5: 1-9 (1 998) and the template prepared in the above (4-1) (the PCR product or the plasmid) were added 
to give 10 uf of a sequencing reaction solution. The primers and the templates were used in an amount of 1.6 pmol 
and an amount of 50 to 200 ng, respectively 

20 [0358] Dye terminator sequencing reaction of 45 cycles was carried out with GeneAmp PCR System 9700 (manu- 
factured by PE Biosystems) using the reaction solution. The cycle parameter was determined in accordance with the 
manufacturer's instruction accompanying ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit. The 
sample was purified using Multiscreen HV plate (manufactured by Millipore) according to the manufacture's instruc- 
tions. The thus purified reaction product was precipitated with ethanol, followed by drying, and then stored in the dark 

25 at-30°C. 

[0359] The dry reaction product was analyzed by ABI PRISM 377 DNA Sequencer and ABI PRISM 3700 DNA An- 
alyzer (both manufactured by PE Biosystems) each in accordance with the manufacture's instructions. 
[0360] The data of about 50,000 sequences in total (i.e., about 42,000 sequences obtained using 377 DNA Sequenc- 
er and about 8,000 reactions obtained by 3700 DNA Analyser) were transferred to a server (Alpha Server 41 00: man- 
30 ufactured by COMPAQ) and stored. The data of these about 50,000 sequences corresponded to 6 times as much as 
the genome size. 

(5) Assembly 

35 [0361] All operations were carried out on the basis of UNIX platform. The analytical data were output in Macintosh 
platform using X Window System. The base call was carried out using phred (The University of Washington). The 
vector sequence data was deleted using SPS Cross_Match (manufactured by Southwest Parallel Software). The as- 
sembly was carried out using SPS phrap (manufactured by Southwest Parallel Software; a high-speed version of phrap 
(The University of Washington)). The contig obtained by the assembly was analyzed using a graphical editor, consed 

40 (The University of Washington). A series of the operations from the base call to the assembly were carried out simul- 
taneously using a script phredPhrap attached to consed. 

(6) Determination of nucleotide sequence in gap part 

45 [0362] Each cosmid in the cosmid library constructed in the above (3) was prepared by a method similar to the 
preparation of the double-stranded DNA plasmid described in the above (4-1). The nucleotide sequence at the end of 
the inserted fragment of the cosmid was determined by using ABI PRISM BigDye Terminator Cycle Sequencing Ready 
Reaction Kit (manufactured by PE Biosystems) according to the manufacture's instructions. 

[0363] About 800 cosmid clones were sequenced at both ends to search a nucleotide sequence in the contig derived 
so from the shotgun sequencing obtained in the above (5) coincident with the sequence. Thus, the linkage between re- 
spective cosmid clones and respective contigs were determined and mutual alignment was carried out. Furthermore, 
the results were compared with the physical map of Corynebacterium glutamicum ATCC 13032 (Mot. Gen. Genet, 
252\ 255-265 (1 996) to carrying out mapping between the cosmids and the contigs. 

[0364] The sequence in the region which was not covered with the contigs was determined by the following method. 
55 [0365] Clones containing sequences positioned at the ends of contigs were selected. Among these clones, about 
1 ,000 clones wherein only one end of the inserted fragment had been determined were selected and the sequence at 
the opposite end of the inserted fragment was determined. A shotgun library clone or a cosmid clone containing the 
sequences at the respective ends of the inserted fragment in two contigs was identified, the full nucleotide sequence 
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of the inserted fragment of this clone was determined, and thus the nucleotide sequence of the gap part was determined. 
When no shotgun library clone or cosmid clone covering the gap part was available, primers complementary to the 
end sequences at the two contigs were prepared and the DNA fragment in the gap part was amplified by PCR. Then., 
sequencing was performed by the primer walking method using the amplified DNA fragment as a template or by the 
shotgun method in which the sequence of a shotgun clone prepared from the amplified DNA fragment was determined. 
Thus, the nucleotide sequence of the domain was determined. 

[0366] In a region showing a low sequence precision, primers were synthesized using AUTOFINISH function and 
NAVIGATING function of consed (The University of Washington) and the sequence was determined by the primer 
walking method to improve the sequence precision. The thus determined full nucleotide sequence of the genome of 
Corynebacterium glutamicum ATCC 13032 strain is shown in SEQ ID NO:1. 



(7) Identification of ORF and presumption of its function 



[0367] ORFs in the nucleotide sequence represented by SEQ ID NO:1 were identified according to the following 
15 method. First, the ORF regions were determined using software for identifying ORF, i.e., Glimmer, GeneMark and 
GeneMark.hmm on UNIX platform according to the respective manual attached to the software. 
[0368] Based on the data thus obtained, ORFs in the nucleotide sequence represented by SEQ ID NO:1 were iden- 
tified. 

[0369] The putative function of an ORF was determined by searching the homology of the identified amino acid 
20 sequence of the ORF against an amino acid database consisting of protein-encoding domains derived from Swiss- 
Prot, PIR or Genpept database constituted by protein encoding domains derived from GenBank database : Frame 
Search (manufactured by Compugen), or by searching the homology of the identified amino acid sequence of the ORF 
against an amino acid database consisting of protein-encoding domains derived from Swiss-Prot, PIR or Genpept 
database constituted by protein encoding domains derived from GenBank database, BLAST. The nucleotide sequences 
25 of the thus determined ORFs are shown in SEQ ID NOS:2 to 3501 , and the amino acid sequences encoded by these 
ORFs are shown in SEQ ID NOS:3502 to 7001 . 

[0370] In some cases of the sequence listings in the present invention, nucleotide sequences, such as TTG, TGT, 
GGT, and the like, other than ATG, are read as an initiating codon encoding Met. 

[0371] Also, the preferred nucleotide sequences are SEQ ID NOS:2 to 355 and 357 to 3501 , and the preferred amino 

30 acid sequences are shown in SEQ ID NOS:3502 to 3855 and 3857 to 7001 

[0372] Table 1 shows the registration numbers in the above-described databases of sequences which were judged 
as having the highest homology with the nucleotide sequences of the ORFs as the results of the homology search in 
the amino acid sequences using the homology-searching software Frame Search (manufactured by Compugen), 
names of the genes of these sequences, the functions of the genes, and the matched length, identities and analogies 

35 compared with publicly known amino acid translation sequences. Moreover, the corresponding positions were con- 
firmed via the alignment of the nucleotide sequence of an arbitrary ORF with the nucleotide sequence of SEQ ID NO: 
1 . Also, the positions of nucleotide sequences other than the ORFs (for example, ribosomal RNA genes, transfer RNA 
genes, IS sequences, and the like) on the genome were determined. 

[0373] Fig. 1 shows the positions of typical genes of the Corynebacterium glutamicum ATCC 1 3032 on the genome. 
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thtoredoxin peroxidase 






hypothetical membrane protein 


oxidoreductase or thiamin 
biosynthesis protein 










chymotrypsin Btl 


arsenate reductase (arsenical pump 
modifier) 


hypothetical membrane protein 


hypothetical protein ' 


hypothetical protein 


GTP-binding protein {tyrosine 
phsphorylated protein A) 


hypothetical protein ! 


hypothetical protein 




CO 
rr 

CU 

LL 
rr 

X 

o 

XJ 
CD 


15 




Matched 
length 
(aa) 


CO 

co 


rr 
CO 






CO 

ro 


CN 
CO 
CM 











CN 


r— 


c 

rr 

CO 




CN 
CN 


rr 
CO 


909 


to 

CO 




CO 

o 


20 




Similarity 
(%) 


58.7 


81.7 






72.0 


O 
CD 
rr 










! 51.3 j 


72.1 


62.4 


rr 


62.9 


76.7 


Ol 
rf 

in 


61.9 j 




91.3 






Identity 

(%) 


35.5 


57.3 






39.9 


34.0 










28.8 


43.2 


23.5 


43.5 


35.8 


CO 

CO 
rr 


27.9 


38.7 




78.6 


25 
30 
35 
40 


Table 1 (continued) 


Homologojs gene 


Thermus thermophilus unrA 


Mycobacterium tuberculosis 
H37Rvtpx 






j Escherichia coli yedL 


Streptomyces coeficclor A3(2) 






i 




Penaeus vannamei 


Escherichia coli 


Bacillus subtilis yyaD ! 


Mycobacterium tuberculosis 
H37Rv Rv1632c 


Mycobacterium tuberculosis 
H37Rv Rv1157c 


Escherichia coli K12typA 


Mycobacterium tuberculosis 
H37Rv Rv1166 


Mycobacterium tuberculosis 
H37Rv Rv1170 




Streptomyces griseus fer 




db Match 


X 

i — 

UJ 

X 

h~ 

«' 

or 
> 

Z> 

CL 


_3 
h- 
o 
> 

i 

X 

CL 
h- 

CL 
to 






o 
o 

LU 

1 

o 

UJ 

>- 
id 
</) 


CM 

I 

CO 
r- 
U. 
O 
CO 
CL 

cn 






i 
i 


« 


< 
> 
z 
in 

LL 
cm' 

or 
i— 
o 

Cl 

in 


_j 

o 

O 
UJ 

i 

CM 
O 

or 
< 

CL 
1/1 


r> 

CO 

o 
< 

CD 

< 
> 
> 
Cl 


CD 
HO 

in 
o 

r-— 

LL 
Cl 


co 
to 
tn 
o 

u_ 

CL 


_j 
o 
o 

LU 

1 

< 

CL 

>- 

1— 
Q_ 
lO 


rr 
r-- 

00 

O 
r- 

UL 

Q. 


m 

co 
o 

CO 

Q. 




sp FER_STRGR 






Li_ • — > 


O 
rr 
m 
CM 


in 

cn 


CO 
CM 


1776 


rr 

LO 

a> 


o 
o 
cn 


CO 
CO 

CO 


r- 

cr> 

CN 


CO 

CM 


co 

ro 


rr 

CO 
CO 


to 

rr 

CO 


1200| 


r»- 
rO 
LO 


rr 


1911 J 


1 506 J 


O 

r- 

CO 


CO 
CO 

rr 


cnl 

CO 


45 




Terminal 
(nt) 


1132133 


1135055 


1 1135691 
j 


J 1135058 


1136938 


1138859 


1139245 


«.l 

cn 
rr 
cn 

CO 


I 1139617 


1139635 


1140028 


1140901 


1142472 


CD 
r- 
rr 

CM 
rr 


1143026 


1146028 


1147602 


1148461 


1148002 


1149267 


50 




initial 
(nt) 


C\t 

rr 
rr 
CO 


CO 

CO 

rr 
CO 


1 135476 


1 136833 


1 137891 


1 137960 


1 138880 


1139196 


1 139357 


CM 

o 
o 

rr 


1140861 


in 

V 
CM 

rr 


1 141273 


1143015 


CD 

CO 

CO 
rr 


1144118 


1146097 


CM 
CO 
UO 

r- 
rx 


m 

rr 
rr 
CO 

rr 


1148953 








4706 


o 

rr 


co 
o 

rr 


cn 
o 
r- 
■cr 


o 

rr 


r-. 

rr 


4712 


4713 


rj 

r- 
rr 


in 

r~- 
rr 


'4716 


r^. 
r - 


CO 

r- 


CD 

r- 
rr 


O 

CN 
rr 


4721 


CM 

cn 
rr 


4723 


TT 
CM 

rr 


4725 J 


55 




ujOz 
cn Z a 


CO 

cm 


£ 1 

§ ! 


1208 


1209 


1210 


1211 


1212 


| 1213 


rr 

CM 




LULU 


1217 
1218 


1219 


1220 


1221 


CM 
CN 

cm | 
" I 


co 

CN 
CN 


rr 

CM 
CM 


«n I 

CM 1 

= 1 



BNSOOCID: <EP 1108790A2J > 



99 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



C CO co 
TOO)—' 



1=- 



o 

E 
o 
X 



o 

OJ 

in 

rc , 
>,co 
E CN~ 

a <L 

= c 

<D C 

Z O- 
o 

** o 



9- c ^ 
x> .2 ro 
^ ro <^ 
> o 

-e 

^5-5 



o 
T3 



O 



O 

o 



to 



O 



3 



St 



o 
o 



t to 
.2 "o 

CD CN 

O h- 
CJ < 



o 

CD 
ID 



O 
CD 



<T3 






> 


O 






c 






0) 






E 




CO 


ro 






o 


















o 






c 






o 


10 


-Q 


c 


CO 


h- 


0) 


o 


cz 


> 




<L> 


OJ 


o 


CT 


CL 


a> 








C 
CD 


5 


Z3 



2 £ 



to 



c 
o 
E 



• — CO 

e 

*x o 



o 

u c 

0>g> 

0- a 



CD 
CO 
CN 



4) 

ro 

jz £ 

CL 2 

to q) 

5 -a 

O c 

u o> 

C7> ro 



cn 



ECN 
_ t 

.2 o 



5 2 

2" (J 
o h- 



CM 




CO 




< 








o 




o 




_o 




~a3 




o 




o 




iS> 




QJ 




U 








£ 




o 




o. 


CO 


or 


hp 



< 
o 
o 



o 
o 

CD 
CO 



cn 
CO 



So "| 
c/5 z » 



55 



co 

CD 



o 

CD 
CD 



CL 

u 

CO 



«i > 

o a: 

to . 
x» > 

o o: 

o r- 
^ CO 
S X 



e — t; 

O i > I 
CL «0 ^3 i 

< ^ 

Sag 



CO 



(O 




T> 




X> 












O 




cu 








O) 




ro 




o 




CL 




!/> 




o 




c: 




o 




E 




o 


< 


o 


>> 




E 



CD 
O 



< 

> 



o 
o 

LU 

I 

< 

o 

id. 



CN 
CD 



CN 



CN- 




CO 




< 








o 




a 




<_> 




a; 




o 




o 




LTl 
















E 




o 




epi 


u 






CO 


CJ) 



CN 
O 



CM 
CN 



CO 
CN 



CO 
CN 



CO 

cn 



CD 



O 

o 



o 
CD 
i 

CD 



t 

2 o 
& E 

CO "~ 



LU 



cr 

co 

I 

a 



to 1 



o 
u 



LU 

o 

CL 

cr 



CD 
CD 



CN I 
T ' 

r- : 



100 

BNSDOCID: <EP 1108790A2_I. > 



EP1 108 790 A2 



5 



10 



15 



20 



25 



35 



40 



45 



55 



Function 


hypothetical protein 


ATPase 


I 

hypothetical protein 


hypothetical protein 


hypothetical protein 






2-oxoglutarate dehydrogenase 


ABC transporter or multidrug 
resistance protein 2 (P-glycoprotein 
2) 


hypothetical protein 


shikimate dehydrogenase 


para-nitrobenzyl esterase 








tetracycline resistance protein 


metabolite export pump of 
tetracenomycin C resistance 




Matched 
length 

(a a) , 




to 

CN 


XT 

in 


CO 
T 


o 

XT 

T— 






1257 


1288 


o 
xr 

OJ 


in 
in 

OJ 


o 
m 








cn 
o 

xr 


XT 

xr 

XT 




Similarity 1 
(%) 


73.2 


72.0 


83.8 


77.0 


87.1 






99.8 


60.4 


72.1 


61.2 


64.7 








61.4 


CN 
XT 
CD 


! 


Identity 

(%) ' 


45.5 


43.6 


60,4 ! 


CO 

oo 

T 


57.9 

i 






99.4 


28,8 


31.7 

i 


25.5 


35.7 








27.1 


32.4 




Homologous gene 


Mycobacterium tuberculosis 
H37Rv Rv1224 


Escherichia colt mrp 


Mycobacterium tuberculosis 
H37Rv Rv1231c 


Mycobacterium tuberculosis 
H37RvRv1232c 


Mycobacterium tuberculosis 
H37RvRv1234 






Corynebacterium glutamicum 
AJ12036 odhA 


Cricetulus griseus (Chinese 
hamster) MDR2 


Mycobacterium tuberculosis 
H37RvRv/1249c 


Escherichia coli aroE 


Bacillus subtilis pnbA 








Escherichia coli transposon 
Tn1721 telA 


Streptomyces glaucescens tcmA 




db Match 


pir:C70508 


_j 
O 
o 

LU 

1 

a. 

DC 

CL 
</) 


cn 
o 
tn 
o 
r»- 
□c 

CL 


pir:C70509 


pir;A70952 






prf:2306367A 


sp:MDR2_CRIGR 


CO 

m 
cn 
o 
r— 
X 

Q. 


o 
o 

LU 

\ 

LU 

o 

or 
< 

CL 

cn 


sp:PNBA_BACSU 








sp:TCR1 ECOLI 


sp.TCMA_STRGA 






CO 
CD 
T 


1125 


cn 

"° 


1290 


CO 
m 


CD 
CO 
CO 


XX 

cn 
in 


3771 


3741 


o- 
0- 


C 

CO 


CD 


in 

CO 


CD 
rv. 

CO 


in 

OJ 

LO 


1215 


1347 


m 
o 
r- 


srminal 
(nt) 


157577 


167587 


rv. 

XT 

ro- 
co- 
co 


159321 


171187 


171871 


71869 


172501 


176308 


180121 


180872 


CO 
O 
CD 

ro 
CO 


184257 


185155 


85218 


87039 


88389 


190526 






































*~ i 


initial 
(nt) 


1167110 


1168711 


1169325 


1170610 


1170672 


1171206 


1172462 


1176271 


CO 
xr 

o 
o 

co 


1180837 


1181675 


1181993 


1183607 


1184280 


1185742 


in 
cm 
oo 
in 

CD 


1167043 


1189822 


SEQ 

NO. 
(a a) 


4743 


«T 

r- 

*T 


4745 


CO 
XT 

r— • 

XT 


Is 


CO 

xr 

r— 

T 


cn 

xT 

r- 
xr 


4750 


4751 


CN 
LO 

r-- 

XT 


4753 


m 
xt 


in l co 

lT> ' UO 

r». rv. 
xr xr 


4757 




CD 

m 
r- 

XT 


4760 


SEQ 

NO. 
(DNA) 


CO 
CN 


XT 
XT 
CN 


m 

XX 


co 

XT 

CN 


! $ 

i CN 


CO 

xx 

CN 


cn 

xr 
OJ 


o 
m 

OJ 


„_ 

uo 

OJ 


1252 


CO 

in 

CN 


1254 


in 

m 

OJ 

• 


CO 

LO 

OJ 


rv- 

LO 


CO 

m 


1259 


1260 



BNSOOCID: <EP 110879OA2 l_> 



101 



EP 1 108 790 A2 



10 



15 

o ±: 
"5. ? 

O CO 
X) <D 

5J ? 

£ E 

QJ O 

5 E -*F 



15 



20 



CO 



-o 



25 



CM 
CN 



30 



CD 
=3 
C 

C 

o 
o 



C 
CD 



o 

CD 

_o 

o 

E 
o 
X 



35 



40 



CO 

O 



CO 



45 



E 



50 



55 



CN 
CN 
CO 
O 
CD 



Si? 

co Z 2. 



O n < 

LU y ^ 
CO Z Q 



CD 

n 



o 
< 



o 

CO 

< 



<d 



o ro 

O T3 

o o 



aj 

E 



"a c 

QJ -D 

C 3 



O CD 
O T3 

-5s X 



CO 

to 



o 

CO 
CD 



in 
in 

CD 


in 

m 

CD 


5 


r- 

ID 


CN 


On) 


5 


r— 


coli 


coli 


ro 


TO 


ichi 


ich 




QJ 



o 
in 

LO 



en 
cn 



E 

i 3 



E 

il 

E 



in 

CD 



3 ^ 



— o 

! LU O 



o 
o 

LU 

I 

a 
a 

> 
a 



CO 

CD CD 



^ CN CO 



1 CD CO 
| CN | CN 



co r— co 

CD CD CD 
CN CN ! CN 



o 

O 

UJ 

I 

Q 
Q 

>- 
U 



CD 
CD 



3 O 

"5) " 



O <v 
ro o 
-D ro 

QJ _Q 



E 

il 
i§ 

ro « 

~S 2 

II 

<u c 
ro "o 

-Q CD 
CD -Q 

H 



CD >•> 



O .CL 

t g 

c 



1 55 



o cn 

5 g 



5 2 
5=C >> 

a, -c 
ro -c 

ol 

6 2 
x :>. 

O O- 



<D 

LU >, 



O 

in 
co 
o 
cn 
< 



o 

CD 

< 

cn 



O 
O 
LU 



LU 



CO 

LO 
CD 



CM 
CN 



CD 
CN 



CD 
O 
CN 

o 

CN 



CD 
CD 



CN 
O 



CO 

co 
o 

CN 



! 



o 

CN 



CD 

CD 
CN 



> 
O 
cr 
a. 



3 

id. 



CD 
CO 



CO 
CN 



CN | 



0) 

E 



o 
in 



> 
o 



E 



ro 

CO 



< 
CO 

> 

o 

cr 
a 



^1-1 



o 

CN 



CD 
O 
CN 



ID N i 

cn 1 cn ; 



BNSDOCID: <EP 1 108790A2_I_> 



102 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



Q 

< . 



2 £ — • ! 



TO 

to 



CO 



O c 

O * 

o> C 

CO <U 

E °- 

O 0) 

E J 

o> K Q 

co g -o 

-jr, -a <u 

Si _ co 

JQ < U 

* Q ^ 



CL 

uu 



< 

LL! 

D 



9- c 

O CD 

If 

o o 

CO O 

c 
o 
E 



o O 

CO I 
X) CO 

o o 

CJ. fo 



CO 



n 



• 3 



< O 



CO 

5 

LL 

I 

m 

a. 
o 

LL 



50 



55 



Oi 
Oi 

o 

CN 



— CO 

— CN 
CO CO 



O 
_j 

u 
Si 



< 
o 

u 



< 



llj y 

to 2 



o 

CO 



O n < 

UJ § 5 i 

W 2 O 





5 




TO 


c 




a> 


5> 


o 




CL 


TO 


CO 

o 


ion 


c5 


a. 


c: 


o 


o 


v> 


CL 


c 


>■. 


TO 


XT 





o 

c\i 



£ ^ 



o cr 

> 

a cr 

=^co 
S X 



in 



1— 




If) 




< 




LU 








1 

CM 




LL 




•2. 




CO 




oL 




CO 




CN 


tn 


O 


CO 




a 


CO 





(M 
CN 



CO 



CO o 



O 
o 
to 



-O > 

o cr 

>^ ro 
^ X 



.c 
o 

CL 



CM 



CO j 



cn 

I CO i 

fNJ 1 



-— CN 

2 > 

TO 

X) > 

o cr 

o 

>>co 
2 X 



CO 
O 
> 



o 

CM 



E <u 

3 T3 



C_ X 



CO 
CD 



< 



CN 

cn 



I n 



BNSDOCID: <EP_ 1 108790A2 I > 



103 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



50 



55 




104 

BNSDOCID: <EP 1 108790A2J_> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



5 ~ 



CO 



C 
<D 



O 

E 
o 
X 



u o 



w 

g.f • 

tn ^ C 
o c >, 

a. E o 

O — 

G O o 
E Q- -Q 



> 



a> o 

$ ^ 
a> a. 



CD 

to 



o 

CL 



E 

O) 

E 

76 



E t/) 

2 to 

>— tj- 

a> o 

ID co 



a: 
O 



55 



E 
i— ■ 



< 

en 
< ( 

x 



2° s 

co Z 2. 



LiJ ^~ 
1/5 2 Q 



CO 

rn 



< 

q: 

LU 
CO 



or 

a. 



E 



■ =t CD 

5 > 

-Q > 

o £T 

o r- 



3 



O 

>- 



Q 

CD 



O 
ID 
tD 

lO 



cn 

LO 



o 

CD 



E 2 



^ > 
2 a: 



:> x 



f s 

li 

o c 



o 

>- 



CN 

a 

o 



U") 



O 
CN 



c 

fM 
CO 



3 



CL 

o 

-O 

6 
E 



d < 



-O > 

o a: 

>* CO 
^ IE 



O 
O 
UJ 

<' 

LU 

O 



CD 
O 



o 

to a» n 

t: c 

ST 

o o <f 

^ C O 

. c a? 

- c 1/1 

03 b o 

5. = .o 



E s 8 



< 



< 

< 

i 

CM 

X 

o 



o 



o 

CL. 



o 

CD 
CO 



§5 

.2 CO 

5 > 

> 

o tr 

=^co 
2 X 



o 
>- 



o 
O 
> 

CL 



5 « 



75 
o 
75 

JZ 

CL ^ 

9- E 

?3 
2 2 

CL 3 

8 _g> 

x> a) 
c o 
d ro 



CN 
CN 
CO 



CO 
lO 



CN 
5 



CD 
CN 



O 
O 



O 

O 
LU 

I 

LU 
U 

or 



CD ! o 



BNSDOCID: <EP U08790A2_I. > 



105 



EP 1 108 790 A2 



5 
10 




Function 




hypothetical protein 


ATP synthase chain a (protein 6) 


H+-transpor1ing ATP synthase lipid- 
binding protein. ATP synthase C 
chane 


H+-transporting ATP synthase chain 
b 


iH+-transporting ATP synthase delta 
chain 


H+-transporting ATP synthase alpha 
chain 


H+-transporting ATP synthase 
gamma chain 


H+-transporting ATP synthase beta 
chain 


H+-transporting ATP synthase 
epsilon chain 


hypothetical protein 


hypothetical protein 


putative ATP/GTP-binding protein 


hypothetical protein 


hypothetical protein 


thioredoxin 


15 




Matched 
length 
(a.a) 




o 

CO 


m 
cn 




LO 


T 

r-- 

CN 


CO 

in 


o 

CN 
CO 


CO 
CO 


CN 
CN 


CN 
CO 


o 

CO 
CN 


m 

CD 


CO 


o 


o 

CO 


20 




Similarity 
(%) 




99.0 


56.7 


85.9 


66.9 


67.2 


88 4 


76.6 


100.0 


73.0 


67.4 


85.7 


56.0 


CO 
CO 


79.2 


71.4 






Identity 
(%) 




a 

CO 
CD 


24.1 


m 


27.8 

i 


CO 
CO 


cn 
CO 

to 


46.3 


99.8 


41.0 


38.6 


70.0 


o 

140 
T 


CO 

ih 
CO 


54.5 


CO 

co 


25 
30 


Table 1 (continued) 


Homologous gene 




Corynebacterium glutamicum 
atpl 


Escherichia co!i K12 atpB 


Streptomyces lividans atpL 


Streptomyces lividans atpF 


Streptomyces lividans atpD 


Streptomyces lividans atpA 


Streptomyces lividans atpG 


Corynebacterium glutamicum 
AS019 atpB 


Streptomyces lividans atpE 


Mycobacterium tuberculosis 
H37Rv Rv1312 


Mycobacterium tuberculosis 
H37Rv Rv1321 


Streptomyces coelicolor A3(2) 


Bacillus subtilis yqjC 


St 

o 

L> 
CD 

c: co 
x cn 
.2 co 

> 

d fx: 

JD > 

o o: 

>> CO 

2 X 


Mycobacterium tuberculosis 
H37Rv Rv1324 


35 
40 




db Match 




CM 

CO 
"^T 
O 
CD 
< 
ZD 

a 

O 


sp:ATP6 ECOLl 


_i 
cn 

h- 
OO 

1 

I 

CL 

< 

Ql 

tn 


_i 
cn 
\— 

CO 

1 

LL 

Cl 
< 

Cl 
tn 


_j 
cn 
\ — 

CO 

I 

Q 
Cl 

< 
Cl 
t/i 


i 

cn 
i- 

co 

«' 

Cl 

< 

CL 


sp:ATPG_STRLI 1 


_j 

cn 
O 

o 

I 

CO 

Cl 
\— 
< 

CL 

tn 


_j 
cn 
i— 

CO 

i 

LU 

CL 

h- 

< 
Cl 
Ui 


ZD 
h- 
CJ 
> 

CD 
>- 

CL 

tn 


ZD 
1— 
O 
> 

1 

to 

CO 

o 
> 

Cl 
tn 


in 

CO 

1 

in 

O 
to 

CN 

CJ 

CO 
CL 

o 


sp:YQJC_BACSU 


u 
> 

CD 
CN 

u 
> 

CL 

tn 


ZD 
h- 
O 
> 

I 

CN 

Q 
> 

CL 

m 






1 ll^ 

! rv CL 

i §e 


CO 
CO 

"^T 


249 


o 

CO 


O 

! "*T 
I CN 


CO 
lO 


co 
co 


1674 


m 

CD 


1449 


OJ 

CO 




O 
a> 
to 


in 

CO 

CM 


CO 

m 


CN 
CO 


CN 
CO 


45 




Terminal 
(nt) 


1271698 


1272119 


1273149 


! « 

CN 
IT) 
CO 

I r- 
CN 


1274122 


\ 1274943 


1276648 I 


1277682 


1279136 


1279522 


1280240 


1280959 


1281251 


1281262 


1282105 


I 

t — 

CO 

co 

OJ 


50 




Initial 
(nt) 


1271213 


1271871 


1272340 


1273286 


1273559 


CO 


— [ 

1274975 


1276708 


1277688 


1279151 


1279770 


1280270 


1280967 


1281714 


1281794 


1282194 






Ss « 

CO 2 * 


4R34 


4835 


CD i r-- 
co i co 

CO CO 


4838 


4839 


4840' 


4841 


4842 


4843 


4844 


4845 


4846 


4847 


CO 
OO 


l 

I 4849 


55 




LU O 2 

co 2: r 


" NT 

, r* 
i - 


|1335 


cr 

<r 
c 


1337 


1338 


1339 


1340 


1341 


CN 
N* 


CO 
CO 


CO 


m 

CO 


to 

cn 


r- 

CO 


1348 


cn 
co 



BNSDOCID: <EP 1108790A2J. > 



106 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ay 



E 

o 
X 



-— c 
ro o 



CJ C 



§ s: § s 

2 * ! 2 o 
m 
ro 



S E 



2 E 



to 
to 

CO 



O 
O 

LD 



si 



E 



no 



O 

cj 

LLI 

I 

o 

D 
00 
00 



CO 

cn 



2g s! 

00 Z 



cn j co 



co 



O 
o 

LU 

I 

CD 
Z> 
00 
00 



hJ 

x: c 
o <i> 

CD *E 

= -O 
?» C 

ro 2* 
x: en 

f s 



E 10 

_ m 

5 > 
ro 

X3 > 

o tr 

u r^. 
>^co 
2 X 



Q ro 



O 

a 

LU 

! 

CD 

CD 
_J 

O 



o 



< 
n. 



• < -E 

: ^ a» 

i ? 2 

! .£ Cl 

I CT> 

O cz 

! -o < 



c 


c 




OJ 


o 


o 






ex 


o. 


ro 


ro 


o 


o 


a> 


ai 


«c 


XI 


C3 


o 


Q. 


o_ 


>> 


>> 


JZ 





o 

to 



in 

CO 
CD 



O 



LU 



o 
o 

LU 

I 

o 
a 

LU 



O w < I 

LU O 2 « 
! (/) 2 O . 



m I to 

CO J CO 



cn 



m 


CO 




tn 


m 


<r> 


CO 


CO 


CO 








m 


8 




m 






CO 


"1 


ro 



E r ^ 



^ g ■ 

5 > ! 5 > 

ro ^ 

o or 

>sro 
2 X 



o 
to 

CO 

o 



-O > 

o cr 
cj 

>■« ro 
^ X 



un 

CM 

o 



co 

co 



c _^ 

2 c 

SI 



ro a? 

cl ro 
ro c 
c cn 
o o 

2 ^ 

Cl 

o a> 
> -o 

To ro 
c > 
ro ^_ 

*2 

c ^3 

2 c 



£1 
O 



IS 


m 




o 


meli 


melt 


E 


um 


zob 


o 

IM 


Rh 


Rh 

r\ 1 1 



X 



CN 
<C 

cn 



X 

or 

i 

m 

x 
u. 

Cl 



T2 

E 
tn 
ro 
Cl 

CO 
CN 

cr 



E 

3 CO 

J5 cn 

o a 
^ o 

5 2 



> 
o 

CL 



X 

or 



LU 
2 



O 

o 
o 



BNSDOCID: <EP 1108790A2 t_> 



107 



EP 1 108 790 A2 



5 
10 


Function 


transcriptional regulator 


acetyltransferase 








•NJ CO 

J; ra 

3. 

E S 
o £ 
£ >> 
E £ 

>E 
a> oT 
E 5 

< 3 

2 o 

CC XI 




hypothetical protein 


tetracenomycin C resistance ana 
export protin 




DNA ligase 

(polydeoxyribonucleotide synthase 
[NAL>] 


hypothetical protein 


glutamyl-tRNA(Gln) 
amidotransferase subunit C 


glutamyl-tRNA(Gln) 
amidotransferase subunit A 


vibriobactin utilization protein / iron- 
chelator utilization protein 


hypothetical membrane protein 


pyrophosphate-fructose 6- 
phosphate 1-phosphotransrefase 




15 


Matched 
length 
(a a) 


CD 
LO 


co 








CD 
CO 




CN 
CO 
CO 


a 
o 

LO 




r~ 
CD 


o 

CN 
CN 


CD 


CO 
T 


CO 
CD 
CN 


CO 
CD 


oo 

LO 
CO 




20 


milarity 

[%) 


76.3 


55.3 








80.9 




66.0 


65.8 




70.6 


70.9 


64.0 


83.0 


54.0 


79.2 


77.9 




CO 






































i 

i 


Identity 

(%) 


m 
r- 


34.6 








to 




r*» 

CO 
CO 


30.2 




42.8 


o 
o 


53.0 


74.0 


28.1 


a> 
to 
xr 


54.8 




25 

CO 

zz 
c: 

"c 
o 

30 r 

<D 

35 
40 


Homologous gene 


£ 

CO 

ro 
Cl 

co 
cm 
Ql 

O UL 

2 E 

to 

t ^ 

'3 ™ 
o ^ 

N CJ 

£1. 


Escherichia coli K12MG1655 
yhbS 








Mycobacterium tuberculosis 1 
H37RvRv3024c [ 




Mycobacterium tuberculosis 
I H37Rv Rv3015c 


Streptomyces glaucescens tcmA 




Rhodothermus marinus dnlJ 


Mycobacterium tuberculosis 
H37Rv Rv3013 


Streptomyces coelicoior A3(2) 
gate 


! Mycobacterium tuberculosis 
H37Rv gatA 


Vibrio vulnificus viuB 


Streptomyces coelicoior A3(2) 
SCE6.24 


Amycolatopsis methanolica pfp 


! 
i 

j 


db Match 


CO 
X 

cc 

I 

u_ 

> 

Cl. 

LO 


sp:YHBS_ECOLI 




i 
i 




pir:C70858 




j CO 
! CO 
O 

r- 

! ? 

Q- 


< 
O 
tr 
i- 
co 

«' 

u 

CL 
to 




or 

O. 

X 

cc 

1 

—> 

> 

Q 

cO 


pir:H70856 


O 

CJ 
CO 

i 

u 
< 

CD 

iCL 
1/1 


sp:GATA_MYCTU 


> 

CD 

>■ 

CO 

ZD 

> 
Cl 
to 


N 
CM 

I 

CO 
LU 

o 

CO 

bl 

OT) 


UJ 

> 
< 

1 

CL 
LL 

0- 

Cl 
cn 






LO 
CN 
CN 


o 

LO 


(N 
CD 


1149 


CD 
CD 
CO 


1095 


vn 

CD 


O 
CD 
CD 


1461 


lD 
CO 
r- 


2040 


CO 
CD 
CD 


r- 

CD 
CN 


1491 


CD 
CO 


CD 
O 
CO 


i — 
; 1071 




45 


Terminal 

(nt) 


1300145 


— r 

1301055 


f 

1300988 i 


1301975 


1303694 


1304923 


1303883 


1305921 : 


1305924 


1307462 


1310369 

i 


: 1310435 


j 1311616 


1313115 


1314118 


CZ> 
TT 

CO 


1316083 




50 


Initial 
(nt) 


1300369 


1300552! 


13019291 


1303123 


1303299 , 


1303829 


1304536 


1304932 


1307384 


CD 
Ol 

. CC 

! a 
i ^ 


1308330 


1311097 


I ...... 

1311320 


' 1311625 


o 

h- 
CN 
CO 

CO 


1314775 


1315013 




CO ^ 5 


r- 
t to 


4868 


CT 
CC 
cc 


c 

T~ 
OC 


s 


CN 

r- 
■ co 

I ^ 


4873^ 


,4874 


4875 


: cc 
f 

! cc 


4877 


4878 


4879 


4880 


j co 


,4882 


4883 


55 


uj Q 2 

CO 2 C 


r 

1367 


CO 

CD 


i CT 


> . c 

> ' ^ 


> *- 

> c 


1372 


1373 


I r*- 

CO 


1375 


1? 


) r- 
r- 

■) CO 


co 
r— 
co 


CD 

r- 

CO 


1380 


CO 
CO 


1382 


1383 



108 

BNSDOCID: <EP U08790A2_L> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



E 

c 
o 



03 



f£ 

to 



T3 



| ! 

E ! 



o 

Q. 
<D — 
co O 
CD ^= 

f§ 
t u 

ro 0> 
<u 

1-8 

— TO 
o> — 



r— 
CO 



cn' 



i t 



ro 



CL 



o 



55 



e? 15 



LU 






J 


o 


o 


< 


U 


CD 

1 


LU 


< 


1 

< 


CL 


co 


O 


CO 


o 


QC 




CL 


to 






CM 


o 


r-. 




in 



Ol 
CD 



LT> 
CO 

5 

CM 

5 



co 

1c 
o 



o 

Q. 



o 

CD 



I . 

JS! 
52 j 



So - 5 



Sofjsi 



o 
u 

LU 

I 

u 

00 
CO 

or 

Cl 



"g CD 
in £ 

I It -P 



LLI 



o 
o 

LU 

I 

ca 
co 

CD 

cn 

Ol 



cn 
uo 
CD 

5 

CM 

O 
(J 

rc 
1c 

u 

cn 

■5 9 

LU •£ 



o 
o 



CL 

o 

CL 



O 

u 

LU 

I 

Q 
co 

CO 
IX 
c 



o 
ra 

> 

a3 
o 

cn 
<D 
u 

E 

2 o 
m oj 
rr 

o o 

O /v 
TO _ 

CO > 



X3 

< 



> i2 



CO 

< 

LU 



>- 
CL 



o O 

QJ Li. 

~ o 

CO CO 



o 

GO 



O 

o 

CD 



00 

co 



or 



2 2 



CD 

a 



LU 



o 

Cl 



o 

CL 



O 

k. 
CL 

Q> 

C 
TO 

E 
<u 

E 



o 
a 



LD 

o 



n 

cr> 
TO 

XI > 
O DC 

51 



CM 

n 



CD 

co 
co 



O) 

co 



I CO 

! CO 



<7> I 



! = 



CN! 

cn j 



* - CJ> 

F in 

.?§ 

ro _ 

.O > 

o cr 

o 

co 
^ X 



|1 

o g 

<D 

= o 

o i~ 
:j < 



5 > 

-O > 
O LT 
cj 

>> n 
2 X 



BNSDTXID: <EP 1 10B790A2 l_> 



109 



EP 1 108 790 A2 



3 
LL 



10 



15 



c 

TO 

E 



20 



25 



30 



35 



40 



45 



50 



55 



<U JZ . 

to <i> ■ — ■ 



E 

CO 



T3 
<D 
3 

C 

o 
o 



J3 



CD 
GO 
3 
O 
cn 

o 

E 
o 



E 

3 

U 

E 

TO 
CD 

E > 

•i '^i 

<1) CM i 

IS i 

0) 

o h- 
U < 



3 

o 
,p 
"5 

CO 




CD 

a: 
o 
u 



c. 



QO 

5 

co 

CO 
Q_ 



c <i> 
<u c 
cn a) 
o tr> 

CD CD 



-t= TO 



TO 








c 




TO 




to 




CO 




<D 








E 




3 




E 




XJ 




TO 




U 




O 




c 




N 






c 


~m 
n 


otei 


o 




a 


ex 



O 

co 



2$ 

< c 



2§i 

co 2 ^ 



C o < 
LU § £ 

co2 g 



LU 

< 



< 



CL 
CO 

< 
< 



cr 



cn 
o 

CD 



3 

LU 

o 



Q 
O 
rsJ 
O 



>* 1 



TO 
cj 

<y 

cn in 

o ro 

CL TO 

o o 



CO -C 

a -S I 



c 

TO 

o 
a. 

JZ 



JZ 
"5 

Cl 



o 

O) 



> 



CD 



TO 




.a 




E 




o 




o. 




cn 




QJ 




o 




>■» 




E 




o 




ro 

JZ 


o 


o 




o 
ro 

in 


CD 




O 


JZ 


< 


o 


a 


CO 


CO 



o 

Q_ 
X 
(J 

co 

I 

UJ 

> 

CL 
C0 



o 

CM 
CD 



TO 

JZi CO 

o o 

-o o 
o 

-C CD 

(T CO 



cn o 

O) i CD 



CD 



CD CD 

co r> 



o 

CD 

CO 





CD 




r— 




-^r 




CO 




TO 




CL 




CD 


CD 


CO 


O 


GO 







o 
cn 



BNSDOCID: <EP 1108790A2J_> 



110 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



rc cu — - 
2 - 



oo 



c 



03 

.a; 



is V 

t; • in c 

c - >i ? 

O) o <2 x ^ 

w c ® J a 



.5? <u 



5 o 



TOO 

7 E 

_ o 

-e of - 
Si? 

<L> Cl JZ. 
. TJ W £ 

? 0) € O) 

«/> $p 
Too© 
w x, a; 

o o 

£ CM 



c » 

v to 

c « 

Tr TO J5 

J s > 

t QJ X 

5 * -S 

CO 2 <D 



O i 



2 5 2 
ai cr <5 



E-g £ 



cm 
5 



o 
u 



LU 

O 

a 

X 



o 
o 

LU 

a' 

QJ 



45 



50 



55 



cr £■ 
O ~ 



So 

CO 7 



to 



CM 

to 

O 



cr 



m ,l < 

, Si CD 

^ 1 
CD , LU 

X > 

o ! oo 



Cvi CO 

lo a> 

O j CM 



2 O J - 



CM 



CM 



CO I to 
t- i CO 



LT> CD 

cm cm 



CO ! CT> 
CM ; CM 



I 2 



E 



o 
< 



ZD 
CO 

o 
< 

CD 

i 

o 

X 



CD 
CO 
CO 



*- OJ CO 
CO I CO I CO i 



cn j 
l » * ro 

I CD t CD j 



oo ! oo [ 



111 

BNSDOCID: <EP 1 1C8790A2 l_> 



EP1 108 790 A2 



10 



15 



20 



QJ _£T . — . 



CO 



XJ 



25 



30 



."3 



35 



o 
cn 
_o 
O 

6 



40 



45 



50 



55 



So™ 

co ^ <E 



CO | CD 

CM , CNJ 



oo 2 o 



o 

JZ 

Q. 
T3 

to 

JD ^ 
i 1/1 

Cn RJ 
- ro 

QJ 



Co 



CO 

O 



O 
CD 



CM 

a 



0) 




cn 




ro 




(1) 




IS) 




fO 




ro 




~o 




>> 




-C 




CD 
















TO 




£ 








Cl 




o 




a. 


c 


o 


ZJ 




-Q 




Z3 


CO 


CO 



o ^ 
Cl C 
O 3 
i/) 

T rj 
CO to 



>> 
o 



CD 

a 

TO 



LU < .5? 



E 

ro 

CO 



oo iS 
- to 

c £ c 

o I 



Q) 

o .9- 

cl *r 

1 1 

C ro 

_ zz 

O CD 

3 8 

zj O 

E co 



ro -9 

& 5- 

o 

_ JZ 

i 2 

x 

O Ql 



a> 

c: 
o 



^ O 

QJ 

-a 
c 

QJ 
C_ 

aj 

"9 ro m 

a 2 2 

— - "O ^ 
Q >3 

< ^ 



O 

u 

LLf 



o 
a. 

CO 



u 



el! 



t I 



a 
< 

i 

CN 

tu 



r- 

< 
CO 

I 

Q 

LU 



CO 
V) 

<J o 
aj in 
jd cn 

co 

IS 
5 > 

o nr_ 

o r^- 
co 
S X 



CD 
O 



CM 



3 
co 
a 
< 

<■ 

Q 

CL 



ro 

a 



CM 



LU T3 



o 
o 

LU 

a 
a 

CL 



CD 
O 



BNSDOCID: <EP 1 108790A2_I_> 



112 



EP 1 108 790 A2 



5 



10 



15 



20 



25 



35 



40 



45 



50 



55 



Function 




thiamin-phosphate kinase 


uracil-DNA glycosytase precursor 


hypothetical protein 


ATP-dependent DNAhelicase 


polypeptides predicted to be useful 
antigens for vaccines and 
diagnostics 


biotin carboxyl carrier protein 


methylase 


lipopolysaccharide core biosynlhesis 
protein 


1 


Neisserial polypeptides predicted to 
be useful antigens for vaccines and 
diagnostics 


ABC transporter or glutamine ABC 
transporter, ATP-bindtng protein 


nopaline transport protein 


glutamine-binding protein precursor 




hypothetical membrane protein 




phage integrase 


Matched 
length 
(aa) 




in 

CO 
CO 


m 

T 

CN 


CO 

to 
m 


CO 

cn 

CO 


CO 

o 


CO 


CD 


LO 
LO 




m 
to 


CM 
LO 

CM 


o 

CM 
CM 


CO 
CM 




CM 
CM 
CO 




CO 
CM 
<M 


Jj J 
E - 
Cn 




57.6 


59.6 


56.3 


60.0 


O 
CO 


67.2 


63.5 


78.7 




74.0 


CO 
CO 

r- 


75.0 


59.0 


CO 
CD 
CD 




52.5 


Identity 
(%) 




32.2 


38.8 


23.1 


35,4 


31.0 


38.8 


37.1 


42.6 




67.0 


56.4 I 


CM 
CO 


CM 




28.6 




26.9 


Homologous gene 




Escherichia coli K12 thiL 


Mus musculus ung 


Mycoplasma genita'tum (SGC3) 
MG369 


Escherichia coli K12 recG 


Neisseria meningitidis 


Propionibacterium freudenreichii 
subsp. Shermanii 


Escherichia coli K12 yhhF 


Escherichia ccli K12 MG1655 
kdtB 




Neisseria gonorrhoeae 


Bacillus stearothermophilus 
gfnQ 


Agrobacterium tumefaciens 
nocM 


Escherichia coli K12 MG1655 
glnH 




Methanobacterium 
thermoautotrophicum MTH465 




Bacteriophage L54a vinT 


db Match 




d 

o 

UJ 

\ 

—i 

X 

*- 

cL 

</> 


UJ 
CO 

ZD 

O 

I 

o 

13 

LL 

</) 


LU 

o 

u 
>- 
7> 
cn' 

CO 
CO 

>- 

CL 
co 


_j 
o 

o 

UJ 

1 

O 

o 

UJ 

or 

CL 
CO 


GSP:Y75303 


or 

LL 

O 
or 

CL 

o. 1 
o 
o 

00 

CL 

</> 


_j 
o 
o 

LU 

1 

LL 

X 

X 

> 

LL 

to 


— j 

o 
o 

LU 

I 

CO 
r- 

Q 

LL 

i/) 




GSP:Y75358 


i— 

CO 

o 
< 

CD 
O 

_j 
O 

bL 

in 


in 
fr- 
CC 

o 
< 

1 

u 
o 

V) 


_j 
o 
o 

UJ 

1 

I 

_j 

O 

CL 

in 




pir:H69160 




TT 
LO 
-1 

CL 
CO 

> 

id 

cn 


Li. — - 

St 


co 

CD 


ro 
cn 
cn 


CM 
CO 
r- 


1581 


2121 


CM i 
CO | 


CO 
CM 


CM 
CO 

m 


O 
CO 

•^r 


1080 


o 

CM 


O 

in 

h- 


CO | — 

rr to 
CO f eo 


o 

OO 


CO 

r- 
cn 


CO 

o 


CO 

LO 

r- 


Terminal 
(nt) 


I 1336293 


1388324 


1389073 


1390788 


1392916 


1391638 


1393151 


1393735 


CM 
CM 
T 

cn 

CO 


1395933 


1395097 


1394800 


1395568 


1396561 


1398468 


1398557 


1401333 


m 
co 

o 

o 


Initial 
(nt) 


1387270 


1387332 


1388312 


1389203 


1390796 


i 

1391951 


1392939 


1393154 i 


1393742 


m 
co 

cn 

CO 


1394894 


1395549 


1396410 


1397421 


1397662 


1399534 


co ; 

CM 

cn 
o 
o 1 


o 

T 

cn 
o 
o 


2 O « 
co ^ e 


cn 
m 
cn 


4960 


«- ; CM 
CD CO 

cn | cn 

*T | -T 


4963 


T 

ID 
cn 

*T 


m 

CO 

cn 


to j 
to 
cn | 
^ I 


' oo 
to : CO 
cn cn 
t j t 


4969 


o 
r- 

O) 


4971 


CM 

r-- 
cn 


CO il 

cn 
rr 


4974 


m | to 
cn cn 


SEQ 

NO 

/r\MA\ 


1459 


o 

CO 
T 


-1 CM 

co : co 

NT | "M 


CO 
CD 
•J 


CO 


m 1 
to | 
-r 


to 
to 


r- co 
to i CO 


cn 
to 


O I - 


» I 


1473 


"ST 


m 1 to ! 

^ , ^ ! 



BNSOOCID:<EP 1108790A2 I > 



113 



EP 1 108 790 A2 



5 
10 


Fur.ction 

i 












insertion element (IS3 related) 




hypothetical protein 








! 






i 






DNA polymerase I 


cephamycin export protein 


DNA-binding protein 


<D 

</) 

C 
OJ 
CO 

o 
-o 
n 

0) 
TJ 

CD 

0) 

c 

JC 

a 

o 
E 




15 


Matched 
length 
(aa) 












CO 
CN 




CO 




















CD 
CO 
CO 


id 
m 

rr 


CO 

co 

CM 


rr 
CO 
CM 




20 


Similarity 
(%) 












96.2 




97.0 i 




















80.8 


oo 

CO 


rr 

LO 

CO 


76.1 






Identity 
(%) 












88.5 




89.0 




















56.3 


33.8 


41.3 


46.5 


: 


25 

o 
£ 

30 

03 

JQ 

35 
40 


Homologous gene 












E 

O 

E 
ra 
3 

O) 

E 

.5 

u 
ro 
x> 
a) 

O 6 




Corynebactertum glutamicum 




















Mycobacterium tuberculosis 
polA 


Streptomyces lactamdurans 
cmcT 


Streptomyces coelicolor A3(2) 
SCJ9A.15C 


Pseudomonas putida morA j 




db Match 












pir;S60890 




PIR:S60890 




I 
















>- 

1 

o 

Q_ 
Q 

CL 
1/1 


o 
o 

1 

O 

o 

Q_ 

VI 


gp:SCJ9A_15 


sp:MORA_PSEPU 


1 




Li- 

ry CL 

o£ 


rr 
rr 


«N 

co 

rr 


O 

LO 

1 


rr 

CD 
CO 


cn 

CN 


CN 
CD 


LO 
LO 
CO 




CO 
CD 
CO 


in 

CO 


CN 
CO 


LO 

co 


CO 
rr 

cn 


CD 
O 
CO 


rr 
CO 

LO 


III 


cn 

CN 


2715 1 




1422' 


o> 

O 
CO 


CO 

h- 
CO 


cn 
m 


45 


Terminal 
(nt) 


1402076 


1402703 


1402368 


1403991 


LO 

CN 
rr 

o 

rr 


1404694 


1405320 i 


1406999 


1407167 


1407559 


1408703 


1409428 


1410064 


1411119 


r- 
co 
rr 

rr 


1412572 j 


1412626 


1416459 


1416462 


1418870 


1419748 


I co 
. h- 

; co 

• CD 


50 


to ■ 

~c 

c. — - 


1401333 


1402272 


rr 

r-~ 

CO 
CN 
O 
rr 


1403128 


1403997 


1404885 


rr 

r- 

CO 

o 

rr 


1407109 


CO 

m 

I' 


1407873 


CO 
CN 

o 
cn 
o 

rr 


1409802 


o 

rr 


rr 
CN 
rr 

rr 


1412000 


1412351 


CO 

cn 

CN 

rr 


1413745 


CO 
CO 

co 
r- 

r — 

rr 


1417962 


1418876 


1420036 




2° 3 

co ^ S 


r-- 

CD 


4978 


cn 

cn 
! ^ 


o 

CO 
Ol 
rr 


4961 


CN 

CO 

CJ> 
rr 


CO 
CO 
CO 
rr 


4984 


I ^ 
oo 
I cn 

' rr 


4986 


4987 


4988 


4989 


4990 


4991 


4992 


4993 


rr 
cn 

cn 

rr 


in 

CO 
CO 


4996 


r-- 
cn 
cn 

rr 


4998 


55 


SEO 
NO. 
(DNA) 


r-~ 
r»- 

i rr 


CD 

rr 


1479 


I CO 

CO 

rr 


l = 

= 


CM 
CO 
rr 


CO 
CO 

rr 


rr 
CO 

rr 


. m 

CO 

rr 


CO 
CO 
rr 


co 

rr 


1488 


cr> 
co 

rj 


o 
o> 

rr 


1491 


CN 

Oi 
rr 


ro 

cn 
rr 


rr 
CD 
rr 


in 
o 
rr 


1496 


1497 


1498 



114 

BNSDOCID: <EP 1 108790A2J. > 



EP1 108 790 A2 



5 
10 


Function 


hypothetical protein 


30S ribosomal protein S1 




hypothetical protein 










inosine-uridine preferring nucleoside 
hypolase (purine nucleosidase) 


aniseptic resistance protein 


nbose kinase 


cnptic asc operon repressor, 
ranscription regulator j 




excinuclease ABC subunit B 


hypothetical protein 


hypothetical protein 


hypothetical protein 




hypothetical protein 


hypothetical protein 


hydrolase 


15 


Matched 
length 
(a.a) 


CO 
CD 


m 
*r 




m 

CD 










c 

CO 


in 


CO 

cn 
OJ 


CO 
CO 




CD 


OJ 

in 


OJ 


cn 

OJ 




oo 

CO 

cn 


o 
in 


OJ 


20 


Similarity 
(%) 


CO 

CO 
LO 


71.4 




93.9 










o 

CO 


53.8 


67.6 , 


65.6 




33.3 


59.2 


80.2 


77, 




47.2 


68.0 


58.4" 




Identity 

(%) 


31.9 


39.5 




80.5 










61.9 


23.6 


35.5 j 


30.0 




57.4 


33.6 


38.8 


53.8 




23.2 


OJ 

CO 


30.4 


25 

QJ 
ZZ 

o 

30 

CD 
JD 

35 
40 


Homologous gene 


Streptomyces coelicolor 
SCH5.13yafE 


Escherichia coli K12 rpsA 




Brevibacterium lactofermentum 1 
ATCC 13859 yacE 










Crithidia tasciculata lunH 


Staphylococcus aureus 


Escherichia coli K12 rbsK 


Escherichia coli K12 ascG 




Streptococcus pneumoniae 
plasmid pSB470 uvrB 


Methanococcus jannaschii 
MJ0531 


Escherichia coli K12ytfH 


Escherichia coli K12ytfG 




Bacillus subtilis yvgS 


Streptomyces coelicolor A3(2) 
SC9H11.26c 


Escherichia coli K12ycbL 


db Match 


o 
o 

UJ 

1 

LU 

< 
> 
rx 

LO 


_j 
o 
o 

LU 

i 

CO 

or 

CL 

m 




LU 
OT 
CD 

1 

UJ 

o 
< 
> 

id. 
tn 










< 
u_ 

or 
o 

i 

x 

-) 

Dl 
tn 


3 

<' 

< 

O 

Q_ 

tn 


-j 
O 

O 
LU 

I 

CO 
CD 

or 

CL 

in 


_i 
o 

CJ 
LU 

o 

CO 

< 
tn 




a. 
or 
i— 

CO 

1 

CO 

or 

> 

Qu 
tn 


< 

~o 

h- 

UJ 

1 

CO 

in 
>- 

CL 

in 


i 

O 
o 

UJ 

X 

LL 
h- 

> 

CL 
lO 


_j 
o 
o 

UJ 

) 

O 

LL 

Cl 
V) 




o 

o 
o 

X 
c. 


CD 
OJ 

1 

1 

CO 
CL 

cn 


sp:YCBL_ECOLI 




8 1 


TT 

LO 
CD 


co 

uO 


CD 

r-- 


O 
O 
CD 


1090 


OJ 
CO 
LO 


CD 
OJ 


LO 

cn 


CD 

ro 

o> 


CD 


OJ 

cn 


1038 


co 

CD 


2097 




CO 
CO 


CD 
NT 
CD 


%r 
co 
CO 


oo 
co 

CM 


OJ 
CD 


o 
o 

CD 


45 


Terminal 
(nt) 


r- 
o 
o 

OJ 

■VI 


CD 

m 
in 

CN! 
1 ^ 

vr 


CD 

cd 

O 
OJ 

NT 


1425878 


«T 

LO 

co 

OJ 


CD 

r— 
CO 

r~- 

OJ 


1427804 


CD 

OJ 
m 
OJ 


1428224 


-cr 

CD 

CD 
OJ 
*t 


1430659 


1431575 


1433547 


1436201 


1436775 


1436869 


1438201 


1440026 


1438212 | 


1440675 


r> 

CD 

r- 

*T 
T 


50 


Initial 
(nt) 


i s 


cn 

CD 

o 

cm 


r-w 
m 

OJ 


! s 

I £ 
i £ 


r*- 
m 
o» 

CD 
OJ 


h- 

lT> 
OO 

OJ 

*y 


1428049 


o 
o> 

OJ 

oo 

OJ 

•M 


1429159; 


cm 

lC 
O 
CO 
T 


CD 

r- 
in 

co 


OJ 

to 

OJ 
CO 
XT 


o 
m 

OJ 

CO 


1 434 105 j 


m 

CO 
CO 
CD 
CO 


CD 
TT 
OJ 
h-. 
CO 


CD 

in 

CO 

r-- 
co 

"ST 


1439343 


1440560 


CD 
CO 

in 
-*r 


CVJ 
CD 
CO 
CM 
T 




! So - 


4999 


5003 


5031 


5002 


COOS 


"CT 

o 
tn 


in 
o 
o 
m 


5006 
5007 


5008 


600S 


5010 


r- 

5 

m 


5012 


5013 


5014 


5015 


5016 


5017 


5018 


5019 


55 


uj y n 


1 o> 
i o> 

i 5 


1500 


1501 


1502 


1503 


1504 


1505 


1506 


1507 


CO 

o 

lO 


1509 


1510 


1511 


1512 


CO 
LO 


5 


1515 


1516 | 


1517 I 


1518 j 


1519 | 



BNSQOCID: <EP 11 08790 A2. 1 > 



115 



EP 1 108 790 A2 



5 
10 




Function 


excinuclease ABC subunK A 


hypothetical protein 1246 (uvrA 
region) 


hypothetical protein 1245 (uvrA 
region) 






translation initiation factor IF-3 


50S ribosomal protein L35 


50S ribosomal protein L20 

i 

1 






sn-glycerol-3-phosphate transport 
system permease protein 


sn-glycerol-3-phosphate transport 
system protein 


sn-glycerol-3-phosphate transport 
system permease proein 


sn-glycerol-3-phosphate transport 
ATP-binding protein 


hypothetical protein 


glycerophosphoryl diester 
phosphodiesterase 


0 

qj tn 
c fo 
'tr cj 

|1 
< 3c 

£ £ 


phenyla/anyMRNA synthetase alpha 
chain 




15 




Matched 
length 
(aa) 


CM 

in 
cn 


100 


CN 
T 






cn 


0 
to 


*— 






CN 

cn 

CM 


0 

r- 
cm 


CO 
CO 


ro 
cn 
ro 




CM 


ro 
uo 


















































20 




is 5 

CO 


80.6 


57.0 


47.0 






78.2 


76.7 


92.7 






71.6 


70.4 


57.6 


71.3 


56.0 


50.0 


71.2 










Identity 

(%) 


56.2 


40.0 


31.0 






52.5 


41.7 


75.0 






33.2 


33.3 


26.6 


0 

ST 


0 


26.2 


I 34.0 






25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Escherichia coli K12 uvrA 


Micrococcus luteus 


Micrococcus luteus 






Rnodobacter sphaeroides intC 


Mycoplasma fermentans 


Pseudomonas syringae pv. 
syringae 






Escherichia coli K12 MG1655 
ugpA 


Escherichia coli K12 MG1655 
upgE 


I Escherichia coli K12 MG1655 
ugpB 


Escherichia coli K12 MG1655 
ugpC 


Aeropyrum pernix K1 APE0042 


Bacillus subtilis glpQ 


Escherichia coli K12 MG1655 
trmH 


Bacillus subtilis 168 syfA 






db Match 


sp:UVRA ECOLI 


PIR JQ0406 


PIR JQ0406 






sp IF3 RHOSH 


! uj 

1 LL 
O 

! ^ 

1 ^ 

ro 
_l 

1 ^ 

Ql 
in 


sp:RL20_PSESY 






_j 

O 
O 

LU 

<' 

Q_ 
O 

P. 
ci 


sp:UGPE_ECOLI 


— 1 
O 
0 

UJ 

1 

cn 
o_ 
CD 

CL 

tf> 


_) 
O 

u 

LU 

I 

0 

Q_ 
O 

Ol 
CO 


CO 
in 

CN 

LU 

£T 
a 


z> 

CO 

0 
< 

CD 

\ 

O 
CL 
-J 

O 

CL 
1A 


_j 
O 
0 

LU 

x' 

rx 
i— 

CL 
in 


sp:SYFA_BACSU 








LL — 


1 s - 

co 
CM 


ID 
O 

m 


0 

to 


r- 


2124 


tO 

m 


CN 


CO 
CO 


CN 
CN 

co 


K- 
UD 

m 


\ co 
: O 
cn 


CO 
CO 


CO 


CM 
CM 


o> 

T 

CM 




T 

cn 
m 


1020 




45 




Terminal 
(nt) 


1445333 


O 

CO 

co 


T 

cn 
-c 


1445874 


1445323 


1448358 


1448581 


1449025 


1449119 


1450692 


1451820 


1452653 


1454071 


1455338 


1454102 


1 1455350 


1456948 


1458066 




50 




Initial 
(nt) 


1442487 


1444115 


1445393 


1446158 


1447446 


1447792 


0 
oi 
ro 

CO 


1448645 


1449940 I 


1450125 


1450918 


1451320 


1452758 


1454115 


1454350 


1456036 


1456355 


{ 1457047 


i 






CO ^ 2 


5020 
5021 


5022 


S023 


T 

CM 
O 

in 


in 

CN 

a 
m 


CD 

CN 

a 

UP 


5027 J 


5fl?R 


cn 

IN 
O 

ir 


5C30 


5031 


5032 


5033 


5034 


5035 


5036 


5037 


55 




I 


* o| - 

CN 1 CM 

t to j m 


CM 
CM 

tn 


cn 

CN 

IT 


Cn 

cr 


tr 

CN 

; tr 
I 


to 

CM 1 CN 

i in I in 

1 


i °c 

Cn 

tr 


cm 

CN 

tr 


1530 


CO 

tn 


cn: 
CO 

m 


1 CO 
1 CO 

m 


i ^ 
I "~ 


1535 


1535 


1537 


! 

j 
1 



116 

BNSDOCID: <EP 1108790A2_I_> 



EP 1 108 790 A2 



5 
10 




Function 


phenylalanyl-tRNA synthetase beta 
chain 




esterase i 


macrolide 3-O-acyltransferase 




N-acetylglutamate-5-semialdehyde 
dehydrogenase 


glutamate N-acetyltransferase 


acelylornithine aminotransferase 


argininosuccinate synthetase 




argininosuccinate lyase 








hypothetical protein j 


tyrosyl-tRNA synthase (tyrosine- 
tRNA ligase) 


hypothetical protein 




hypothetical protein 


15 




Matched 
length 
(aa) 


ro 

CO 




CO 

to 

CO 


CO 
Oi 
TT 




CO 


CO 
CO 
CO 


en 

CO 


o 

TT 




CO 








o 
to 




I 




Oi 






Similarity 
(%) 


r- 






co 








OJ 


in 




o 








o 


to 


T 




O 


20 








tri 
to 


to 
to 




CD 

CD 


CD 
CD 


CD 
CD 


CD 
CD 




o 

CD 








oi 


CD 

r>» 


to 




lO 

r— 








C£> 




tr> 


o 




CO 


to 


O 


to 




CO 








O 




CD 




a 






is 

TO 


oi 




to 

OJ 


d 

CO 




co 

CD 


CD 
cn 


CD 
CD 


CD 

cn 




CO 
co 








CO 


CO 


to 

OJ 






25 












































30 
35 


Table 1 (continued) 


Homologous gene 


Escherichia coli K12 MG1655 
syfB 




Sireptomyces scabies estA 


Sireptoinyces mycarotaciens 


1 


Corynebacierium glutamicum 
AS019argC 


Corynebacterium glutamicum 
ATCC 13032 argJ 


Corynebactenum glutamicum 
ATCC 13032 argD 


Corynebacterium glutamicum 
AS019 argG 




Corynebacterium glutamicum 
AS019argH 








Escherichia coli K12 ycaR 


Bacillus subtilis syyl 


Methanococcus jannaschii 
MJ0531 




Chlamydia muridarum Nigg 
TC0129 


40 




jr 
a 

03 

n 

TJ 


Zj 

o 

u 

LU 

1 

CO 

Ll_ 

> 

CO 




o 

CO 

or 

h— 

CO 

<■ 

CO 


or 
j- 

00 

1 

CD 

:> 

Q 




i 

Oi 

CM 
lO 

o 
o 

< 


_j 
O 
or 

o 
o 

i 

—3 

u 
or 
< 


ARGD.CORGL 


—i 
O 
or 
o 
o 

i 

> 

CO 
CO 

< 




AF048764_1 








_j 

o 
o 

LU 

1 

or 
< 

> 


SYY1_BACSU 


< 

~3 

h- 
LU 

1 

fO 

m 
>- 




co 

CO 
LL 

or 

0L 








Cl 
in 




s 


CL 




cr. 


a. 
in 


«rt 


cL 
in 




o. 
cn 








<a 


CL 
i/t 


Cl 
tn 








u 


TT 

co 

OJ 


r*- 


j OJ 

r-- 
cn 


1383 


OJ 

o 
"<T 


1041 


1164 


1173 


CO 

o 

OJ 


1209 


1431 


co 


1575 


OJ 
5 




1260 


m 

to 

*T 


o 

CD 

co 


I - 

1 *~ 


45 




Terminal 
(nt) 


1460516 


i 

CD I CO 
CD O.' 

CO OJ 

in i to 
rr J 


1453516 j 


1453934 


1465123 


1466373 


CO 
T 

to 

CO 

to 

TT 


CO 

r- 


1470154 


r-- 
O 

CD 
OJ 


CTi 

r- 


1475693 


CD 
OJ 

to 
r- 
rr 


CD 

m 
to 
r-» 
t 


1477809 


rM 

CD 
1 — 

*x 


1478503 


! - 

! co 

CO 
CO 
CO 


50 




Initial 
(nt) 


1458133 , 


1458966 


i 

ll 


*e 

CO 
OJ 

to 
*T 


1463533 


1464083 


o 

OJ 

to 
to 
TJ 


to 

CO 

r- 
tn 

TT 


1470211 1 


1471362 


1471477 


1472977 


Ol 
*T 

rr 


1475683 


• to 

1' 


1476550 j 


CO 
CD 
CJ 

oo 
r»- 
rr 


OJ 
CD 
CO 
CO 

r^. 


in""! 

C) 
CO 

^ 1 

! 






CO 2 2. 


5038 


5039 I 


o 

v: 
' O 

in 


VJ 

o 


OJ 

o 

i tn 


; co 
! S 

i 


vr 

• O 
1 lO 


5045 j 


i £ 
o 
in 


5047 


5048 


5049 


5050 


m 
O 
to 


5052 


5053 


tn 
o 
m 


5055 


5056 


55 




co ^ n 


1538 


CD 

ro 
tn 


o 
rr 
m 


\ * 

1 uo 


OJ 
uO 


| 2 

1 5 


1 — 

1 s 


to 
to 


to 

12 


r>- 

lO 


CO 

1 2 


CD 

to 


o 
in 
to 


to 
to 


1 OJ 
■ to 

; to 

1 ■ 


1553 


tn 
tn 


in 

tn 
in 


1556 



BNSDOCID: <EP 11 08790 A2 l_> 



117 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2_L> 



118 



EP1 108 790 A2 



5 
10 


Function 


cytidylate kinase 


GTP binding protein 






cu 
t/i 
to 
cu 

c. 

JZ 
QJ 

£ 


ABC transporter 


ABC transporter 


I 

1 


(hypothetical membrane protein 


! 


5 

c 
o. 

c 
ro 

+ 

X 

4- 

co 






hypothetical protein 


CJ 

ro 
o 
c 
cu 

X3 
<f 

cm" 
ro 

H. 
<u 

JZ 

o 

X 

o «J 
T3 O 

*i 

CM JZ 


preprotein translocase SecA subunit 


signal transduction protein 


hypothetical protein 


hypothetical protein 


15 


Matched 
length 
(a a) 


o 

CM 
CM 


LO 
CO 
XT 






CM 
CO 
CM 


cn 
on 

rr 


CM 

o 

CO 




257 




CD 

on 

XT 






o 

CO 


o 

CM 


in 
o 

CO 


CM 

ro 


"XT 

ro 

CM 


CO 

ro 












































20 


Sirnilari 
(%) 


<C 
ro 
f^- 


o 

XT 

r*- 






67.2 


o 

CO 


co 

CO 

cn 




73.2 




1 61.5 






57.7 


63.8 


61.7 


93.2 


74.4 


63.2 




Identity 
(%) 


38.6 


42.8 






36.2 


29.7 


| 31.2 : 




39.7 




25.7 






36.9 


25.2 


35.2 


75.8 


41.9 


30.8 


25 

CD 

o 

30 ^ 

0) 
.Q 


Homologous gene 


Bacillus subtilis cmk 


Bacillus subtilis yphC 






Mycobacterium tuberculosis 
Rv3342 


Corynebacterium striatum M82B 
tetA 


Corynebacterium striatum M82B 
tetB 




Escherichia coli K12 ygiE 




Bacillus subtilis ATCC 9372 
nhaG 






Escherichia ccli K12o249#9 
ychJ 


Archaeoglobus fulgidus AF0675 


Bacillus subtilis secA 


Mycobacterium smegmatis garA 


Mycobacterium tuberculosis 
H37Rv Rv1828 


Mycobacterium tuberculosis 
H37Rv Rv1828 


40 


db Match 


sp;KCY_BACSU 


tn 
O 
< 

CD 

J 

X 

CL 






3 
f— 

CJ 

> 

i 

CM 

X 
> 
id. 

tn 


CQ 
CM 
O 
CO 
CO 

In 

CM 
t 


prf25l3302A 




sp;YGIE_ECOLl 




LO 
LO 
LO 

cn 

CM 

o 
OD 
< 
Cl 
rj) 






_j 
o 
o 

LU 

X 

o 
> 

CL 


rr 
CO 
CO 

cn 
co 
O 

CL 


3 
co 
O 
< 
CO 

«' 

o 

LU 
CO 

Cl 
to 


CM 

1 

XT 

XT 

co 
ro 
r*- 

LL 

< 
Cl 
cn 


3 

r- 

O 

>- 

t 

LL 

Q 
o 

>- 

CL 
eft 


3 
1— 
O 
>- 

I 

LU 
O 
o 

O- 

</> j 




o ~ 


o 

CD 
CO 


1557 


to 
to 

CO 


CO 

cn 

T 


ro 
CO 


%* 

LO 

m 


1767 


m 

CO 


CD 
CO 


i 

CD 
00 


CO 

LO 


to 

CO 


o 

CM 
XT 


in 
ro 


XT 
CO 


2289 


oil 

CM 
XT 


CO 
LO 

rv. 


ro 1 

CO 
CD 


45 


Terminal 


CO 

cn 

TT 

O 

uo 


| 1506573 


| 1506662 


; 1507405 


1507917 I 


1510366 


1512132 


151C843 


1512977 


co 
cn 
to 

in 


1512980 


XT 

r- 

f 

to 


1515815 


1515408 


1515799 


CO 

m 
o> 
m 


1520029 


m 

XT 

cn 
o 

CM 

m 


1521589 


50 

I 


CO -^y 

C 


CO 

tn 

CM 
XT 

o 
in 


1505017 


j 1507327 


1507902 


1508729 


1508813 


I 1510366 


1511667 


1512189 


1514505 


1514527 


1515159) 


1515396 


1515782! 


1516962 


1517170 


1519601 


o 
on 

o 

CM 

in 


152CS57 j 


i 


O ^ 


5076 


5077 


5078 


5079 


5080 


5081 


CM 
CO 

O 

lTj 


5083 


XT ! 

CO j 

o 
in J 


5085 


5086 


i 5087 


5088 


5089 


0605 


5091 


5092 I 


5093 


5094 


55 

■ 


UJ ^ £ 

CO ^ Q 


1576 


in 


1578 


cn j 

m j 


1580 


1581 


CM 
CO 

m 


ro 
co 
m 


T 
CO 

m 


in 

CO 

in 


1586 


S| 


CO 
CO 
LO 


cn 

CO 

-~o 


o ! 

5 1 

5 


1591 


CM 
O) 

LO 


1593 


XT 

cn 
in 



BNSDOCID/. <EP 1108790A2 I > 



119 



EP1 108 790 A2 



10 



15 



ra O - — ' 



20 



CO 



o 
a. 



25 



3D 



D 

n 
c 

c 

o 

o 



Q) 



O 

E 

o 



35 



40 



5 > 

" IT. 
ro 

-D > 

o cr 

a 

^ X 



O 
>- 

I 

LU 

O 
o 



45 



si 



50 



55 



o 


cn 


cn 


LO 


to 


CD 


CD 




co 





o2 5 Cn 



mi J 2 , 

S z 9 i 



CL 




T5 










:>> 


in 








CJ 




m 


to 


</) 


cO 


3 




O 


O 


ro 


ro 


CD 


CD 



o 

Q. 

cn 

c 

T3 
C 

d 



a 



GO 

a 
< 

CD 
CL 

Q _ 
X X 
>- > 

Q_ CL 



CD ^! 



CD 
CD 



I s ? 

.5 co 

5 > 
^ a 

_n > 

o er 
o 

S X 



CJ) I O} I CO 

ld co j co { co 



cn VJ 



u 
> 

CO 1 

Q 

> 

CL 



CO 




CD 




o 




C3 




u 




O 




.a 












E 






NT 
CO 




> 




tr 


ro 


-Q 


> 


O 


cc 


u 


r«- 


co 


2 


X 



JO 

E 



CN 

ro 

CN 



c 






<D 




<1> 


to 




to 


>N 




>* 


</> 




to 


t 






o 




o 


Cl 




CL 


in 




in 






c 


ro 


c 


TO 




a 






o 


to 


<D 




a> 


ro 


CL 


ro 


c 


CJ 


c 


o 


to 


o 


-C 


ro 


J= 


CL 


o 


CL 


to 


E 


i/l 


O 




o 




c> 




CL 


CL 


Cl 



o 

CO 
CO 



CO 
CD 

o 
O 



— o 

P CO 

§ 00 

CO 

s > 

<=> or 
ro 

o > 

o cr 

o 

>vCO 

5 X 



o 



-cL-E 

(O « 

5 s 

CL Cl 



O j T- , 

o o 

(O CD . 



CO 

o 

CO J CO 



LU 



CO 
LO 



CO 
LO 



X 

CL 



O 
LO 
CM 



CN 



CL 
CM 



O 



CN 

5: 



O 

CJ 
LU 



X 
CL 

CL 



O 

a 

LU 

I 

o 

X 

CL 

Cl 



O 
CO 



o 
co 



CN 
CO 
CO 
LO 
CO 



O) 

o 

CO 



CM 
CM 
CO 
CO 
CO 



CO 

CO 



CO 
CD 
CO 
co 
CO 
I cO 



: co 



CO 

T 



BNSDOCID: <EP 1108790A2_L> 



120 



EP 1 108 790 A2 



5 






ne kinase 


ase 


-phospholipid 


o 

OJ 

ro 

II 


ransferase 




ocation pump 




















inose 


1 

apolipoprotein N-acyltransferase ] 






10 


Function 




phosphomethylpyrimidi 


hydoxyethylthiazole kin 


cyclopropane-fatty-acyl 
synthase 


sugar transporter or 4-r 
pnthalate/phthalate per 


purine phosphoribosylti 


hypothetical p'otein 


arsenic oxyanton-transl 
membrane subunit 




hypothetical protein 


sulfate permease 


hypothetical protein 










hypothetical protein 


dolichol phosphate mar 
synthase 




secretory lipase 


15 


Matched 
length 
(aa) 




CN 
CO 
CN 


o 

T 
CN 


m 


CO 
CO 
vT 


CO 

m 


CO 

o 

CN 


CO 

CO 




CM 
CN 
CN 


CO 
CO 


o> 










o 


CM 


r«- 
CN 
LO 




CN 

cn 

CO 


20 


Similarity 
{%) 




70.2 


77.5 


55.0 


66.9 


59.0 


68.5 


54.6 




83.8 


83.6 


50.0 










87.3 


71.0 


55.6 




55.6 




Identity 
(%) 




47.3 ! 


46.6 


CO 
CO 
CN 


m 

CN 
ro 


36.5 


39.8 1 


23.3 




62.2 | 


51.8 


; 39.0 










71.8 


39.2 


25.1 




r- 
co 

CN 


Table 1 (continued) 


Homologous gene 




Salmonella typhimurium thiD 


Salmonella typhimurium LT2 
thiM 


Mycobacterium tuberculosis 
H37Rv ufaA1 


Burkholderia cepacia Pc701 
mopB 


Thermus flavus AT-62 gpt 


Escherichia coli K12 yebN 


Sinorhizobium sp. As4 arsB 




Streptomyces coelicolor A3(2) 
SCI7.33 


Pseudomonas sp. R9 ORFA 


Pseudomonas sp. R9 ORFG | 










Mycobacterium tuberculosis 
H37Rv Rv2050 


Schizosaccharomyces pombe 
dpml 


Escherichia coli K12lnt 




Candida albicans Iip1 


40 


db Watch 




>■ 
p- 
_j 

< 

CO 

i 

Q 

P 

d. 
cn 


>- 
h- 
_j 

< 
co 

i 

X 

CL 


o 

CO 
00 

o 
j_ 

Cl. 


CD 

cn 
co 

CO 
CO 
CN 
CN 
CN 

5. 


m 

CN 

in 

CO 

o 

(N 
CN 

cL 


i 

o 

CJ 
LU 

i 

z 

CD 
LU 

>- 

CL 
</> 


CN 

1 

CD 

in 
r- 
co 
r- 

< 
CL 

cn 




CO 
CO 

1 

O 
CO 

CL 

cn 


gp:PSTRTETC1_6 


GP.PSTRTETCW 










in 

T 
CO 

o 
< 

cL 


< 

CO 
CD 
T 

CO 
CN 

Cl 


__j 
0 
a 

Li. 

t— ' 

_ j 

CL 
</» 




gp:AF188894_1 




ii 


CM 
O 


1584 


O 

co 


1314 


1386 


r- 


cn 

O) 
CO 


CO 

cn 


CO 
CO 


co 
o 

CO 


in 
in 


CO 

CN 
vT 


m 
to 


p- 
o 

CN 


CD 
CO 


o 
m 


CO 

cn 

CO 


o 

CO 


1635 


r«- 


1224 1 


45 


Terminal 
(nt) 


1538963 


1539820 


1542119 


cn 
co 

CN 
CO 

%r 
in 


1546307 


1547967 


CD 
N 
CO 

cn 

TT 

m 


1550398 


1550951 


1552237 


1553972 


1553297 


1554070 


1555067 


1554891 


| 1555086 


1556771 


1557014 


1557859 


r>- 

CD 

-s- 

CD 

in 
in 


1560437 


50 


Initial 1 
(nt) 


rr 
CD 
CO 

cn 

ro 

in 


CO J CN 
O | CN 
T O) 
— CN 

uo tn 


CO 

Oj 
T 

tn 


1547692 


o 

CO 

n 
m 


m 
to 

CO 
*T 

in 


1549403 


1550469 ! 


1551545 


i 1 55251 8 | 


1553722 


XT 
00 
CO 

TT 

m 
m 


1554861 


1555079 


1555835 


1556376 


1557823 


1559493 


r- 
co 

CN 

C 

CO 

in 


1561660, 




SEQ 
NO 
(a. a ) 


5116 


5117 


CO 

m 


5119 


5120 


5121 


5122 


5123 


CM 

m 


in 

CM 

vn 


5126 


5127 


5128| 


5129 


o 

CO 

m 


5131 j 


a 

in 


'5133 1 


ro 

LO 


5135 


5136 


55 


LJ V ^ 


CO 
CO 


to 


CO 


1619 


1620 


CN 
CO 


CN 
CM 
CO 

- 


CO 

s 

L 


T 
CM 
CO 


uO 
CN 
CD 


llC26 


r»- 

CN 


CD 
CN 
CO 


o> 

CN 
CO 


o 

CO 
CO 


1631 


j 1632 


CO 

ro 
CO 


ro 
CO 


1635 


CO 
ro 
CO 



121 

BNSDOCID: <EP. 1108790A2 I > 



EP 1 108 790 A2 



5 
10 




Function 


precorrin 2 melhyltransferase 


n 

- tn 
C) ro 

> W 
CD tZ 

c 

o 1- 

o i= 
<u a) 

a. E 


oxidoreductase 


dipeptidase or X-Pro dipeptidase 


ATP-dependent RNA helicase 


sec-independent protein transiocase 
protein _ 


hypothetical protein 


hypothetical protein 


hypothetical protein 


hypothetical protein 


hypothetical protein 


hypothetical protein 
hypothetical protein 


15 




Matched 

length 
(a a) 


CM 


T 


CM 


CM 
CO 
CO 


1030 


CD 
CD 
CM 


to 
CO 


I 

CO 


CM 
CO 


r- 
CD 


CO 


516 
j 159 


20 




Similarity 


56.7 I 


00 

o 

CD 


75.4 


61.3 


LO 

in 


r- 

CN 
CO 


69.4 


61.2 


64.8 


77.3 


80.3 


74.2 
50.0 






Identity 

(%) 


CO 

R 


32.4 


54.1 


36.1 


26.5 


28.7 




31.9 


32.4 


53.1 


m 


<o o 

CO CM 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37Rv cobG 


ul 
C 

<o 

(J 

T3 

*/> 

S o 

Q_ CO 


Mycobacterium tuberculosis 
H37Rv RV3412 


Streptococcus mutans LT1 1 
pepQ 


Saccharomyces cerevisiae 
YJL050W dob1 


Escherichia coli K12 tatC 


Mycobacterium leprae 
MLCB2533.27 


Mycobacterium tuberculosis 
H37Rv Rv2095c 


Mycobacterium leprae 
MLCB2533.25 


Mycobacterium tuberculosis 
H37Rv Rv2097c 


Mycobacterium tuberculosis 
H37Rv Rv2111c 


Mycobacterium tuberculosis 
H37RvRv2112c 

Aeropyrum pernix K1 APE2014 




db Match 


pir:C70764 


LU 

Q 

UJ 
CO 
Q_ 

1 

_J 

m 

O 

o 

Cl 

V) 


ZD 
\~ 

o 
> 

CN 
> 

CL 
tn 


i 

o 

CD 
T 
O 

< 
O- 


CO 

< 

UJ 

> 

1 

ot 

r- 
CX 


sp.TATC_ECOLI ! 


UJ 

_j 
o 
> 

1 

N" 
CO 

t 

1 </» 


ZD 
t- 
O 
> 

1 

m 

CO 

> 

Id- 
Ol 


UJ 

_j 
o 
>- 

CD 
CO 
> 

> 

t#i 


sp:YY37_MYCTU 


pir;B70512 


CM O 
LO 

m cn 

O r- 

y t± 






&f 

o ~ 


r-- 
r- 


1278 
366 


to °o 

%T CO 


1137 


639 


I 

1002 


LO 

t CO 


CO 
CD 


CM 

r- 

CD 


1425 


CD CM 
T CD 
CM t- 


1542 
480 


45 




Terminal 

(nt) 


1562553 j 


1562525 j 
1564237 


1564482 
1564565 


1565302 


to f — 

o *- 

U"> j LO 


1569932 


| ! 

1571068 


1571506 


1572492 


1573491 


1575205 
1574945 


1575406 
j 1577806 


50 




Initial 

(nt) 


1561780 


1563802 


1564237 
1565302 


CO 

ro I 
T 

CD . 

CO 

CO 


CO ! CO 

co ' o 
rr CD 
CD CD 
CO CD 

m lo 


1570933 


1571382 


1572486 


1573463 


1574915 


co 

LO ! ro 
CD 

LO 

in lo 


1576947 
1577327 






j co 2 r 


-| £ 

j ro 

i £ 


co a 
co r 

LO u 


> o I «- 
i "-r ! 

") LO j U"; 


5142 


5143 
5144 


! f 

to 


CO 
T 

LO 


r- 
in 


5148 


! o 

! - 

I IO 


5150 

t>1 0 1 


5152 
5153 


55 




!soj 

I CO ^ C 


1637 


1638 


i 

"> "<r t 

D ! CD CD 


1642 


1643 
1644 


| 1645 


CO 
CD 


1647 


! « 

CO 


1649 


o 

LO LO 
CO CO 


1652 
1653 



BNSDOCID: <EP 1 108790A2_I_> 



122 



EP1 108 790 A2 



5 



10 



15 



20 



25 



O 
o 

30 : 



35 



40 



50 



55 



Function j 


AAA family ATPase (chapefone-like 
function) 


protein-beta-aspartate 
methyltransferase 


aspartyl aminopeptidase 


hypothetical protein 


virulence-associated protein 


quinolon resistance protein 


aspartate ammonia-tyase 


ATP phosphonbosyltransferase 


bela-phosphoglucomutase 


' 5-methyltetrahydrofolate- 

| homocysteine methyltransferase 




alkyl hydroperoxide reductase 
subunit F 


arsenical-resistance protein 


arsenate reductase 


a-senate reductase 




cysteinyi-tRNA synthetase 


Matched 
length 
(aa) 


ID 
ID 


CO 
CM 


to 

ro 


cn 

ID 

CN 


cn 

CO 


m 

CO 
CO 


o 

CN 

in 


CO 
CN 


in 
cn 


1254 




CD 
CD 
CO 


co 

CO 
CO 


cn 
fN 


CO 
CN 




CO 
CO 


Similarity 
(%) 


78.5 


O 

oS 


67.2 


71.4 


1 72.5 


61.0 


CO 

cn 
cn 


97.5 


63.1 


62.4 




49.5 


cn 

CO 

to 


CO 

*T 

CO 


75.6 




64.3 


Identity 
(%) 


51.6 


57.3 


38.1 


in 


CO 
d 


21.8 


CO 

cn 
cn 


96.8 


30.8 


31.6 




CN 
CN 


o 
co 

CO 


32.6 


CN 




35.9 


<D 
CL 
<D 
CO 
t/i 
Z3 

o 
cn 
o 
o 

5 

X 


Rhodococcus crythropol;s arc 


Mycobacterium leprae pimT 


Homo sapiens 


Mycobactenum tuberculosis 
H37Rv Rv2119 


Dichelobacter nodosus A198 
vapl 


Staphylococcus aurejs norA23 


Corynebacterium glutaHcum 
(Brevibscterkim flavum) MJ233 
aspA 


Corynebacterium glutamicum 
AS019hisG 


Thermotoga maritima MSB8 i 
TM1254 


Escherichia coli K12 metH 




Xanthomonas campestris ahpF 


Saccharomyces cerevisiae 
S288C YPR201Wacr3 


Staphylococcus aureus plasmid 
pl258 arsC 


Mycobacterium tuberculosis 
H37Rv arsC 




Escherichia coli K12 cysS 


db Match 


o 

CM 
CO 
CO 
CN 
INi 

T 

Cm 

CL. 


T 

CO 
CN 

CO 
Q_ 


*~i 
o 
m 
o 
in 
o 
o 

LL 
< 
CL 

cn 


CO 

in 
o 
r- 
CD 

Q 


o 

o 
< 

CD 

I 

CL 
< 
> 
Cl 
m 


< 

CO 
Cl 
CN 

co 
m 

CN 
CL 


e> 

or 
O 
u 

<■ 

a 

CO 

< 

CL 

; tn 


I 

CO 

to 

o 
m 
o 

LL. 

< 

CL 
CD 


pir:H72277 


_j 

O 
u 

LU 

J 

. X 
LU 

CL 
O 




sp:AHPF_XANCH 


i- 

00 

< 

LU 

> 

1 

CO 

or 
o 
< 

CL 
«rt 


3 

f- 
U) 

\ 

O 
CO 

or 

< 

CL 

IS, 


pirG70964 




_i 
O 
o 

LU 

1 

o 
> 

CO 

c_ 
in 


LL - — - 
ry Q. 

oS. 


1581 


CO 
CO 


1323 


CO 
CO 


CD 
CN 


1209 


1578 


CO 
T 
CO 


CO 

cn 

CO 


3663 
570 


1026 | 


CD 

r— 


O 

rvi 


cn 
ro 

CO 


CO 
r*- 
ro 


CM 
CM 


To 
c 

1 s 

h- 


1576951 


1578567 


1579449 


1581640 


1582114 


1582273 


1583913 


1535603 ' 


| 1586812 


; 1587573 


1591912 


1591941 


CNJ 

LO 

Oi 

LO 


m 
cn 

*T 

cn 
in 


1595668 


1595844 


cn 

CN 
CO 

cn 
m 


« c? 

'tr c 

C — - 


1578531 


o 

o 

•^T 
Ol 

r*- 

m 


1580771 


1580807 


1581851 I 


I 1583481 


o 
cn 

m 

CO 

m 


in 

■^r 
CD 

CO 

m 


1 1587504 I 


j 1591235 


1591343 


1592S66 j 


1- 
ro 

CO 
CO 

cr. 
in 


1594532 


1595030 


11 596221 


o 

CD 

cn 

•-O 


So ^ 

!(y)2 3 


U) 

in 


5155 


lD 
in 


5157 


5158 


cn ! o 

LO j o 

m ; m 


5161 


CN 
CO 

m 


CO 

to 
m 


TT I LO 

cn ; tr> 
m | lo 


5166 


5167 

i 


5168 


5169 


m 


SEQ 

NO. 
(DNA) 


1654 


in 
in 
o 


1655 


m 

CO 

1 * 


1658 
1659 

1660 


CO 
CD 


j 1552 j 


ro 
to 

CO 


CO 

CD 


LO 
CU 

to 


1666 


i 1667 
, 1660 


cn i o 
CO i r-- 
O . CD 



BNSDOCID: <EP . 1108790A2 l_> 



123 



EP1 108 790 A2 



5 
10 


Function 
bacitracin resistance protein 


oxidoreductase 


lipoprotein 

dihydroorotate dehydrogenase 


transposase 


bio operon ORF 1 (biotin biosynthetic 
enzyme) 


Neisseria! polypeptides preaicieu iu 
be useful antigens for vaccines and 
diagnostics 


ABC transporter 


ABC transporter 


puromycin N-acetyltransferase 


LAO(lysine, arginine, and 
ornithine)/AO (arginine and 
ornithinejtransport system kinase 


methylmalonyl-CoA muiase aipna 
subunit 


15 


Matched 
length 
(aa) 


CD 
CN 
CO 


cn t 

LO CO 
CO CO 


o 

CD 
| CO 


CN 
LO 


CO 

<J> 

T— 


cn 

LO 


LO 
CO 
LO 


CD 
LO 


cn 

CO 

ro 




20 


Similarity 
(%) 

69 4 


62.6 


53 5 

67.1 


55.3 


75.0 


33.0 


68.7 


67.1 


CD 

to 


72.3 


87.5 




Identity 

(%) 

37 3 


33.4 


27.0 
44.0 


^4 7 


44.1 


26.0 


43.6 


36.8 


"V 
CN 
CO 


43.1 


72.2 


25 

CO 

c 

"c 
o 
o 

30 
35 
40 


Homologous gene 


tscneriunid t,un r\ m ua^r-^ 
Agrobacterium tumefaciens 
mocA 


Mycobacterium tuberculosis 
H37Rv1ppL 

Agrocybe aegerita ural 


Pseudomonas syringae tr.pA 


Escherichia coli K12 ybhB 


Neisseria meningitidis 


Corynebacterium striatum M82B 
tetB 


Corynebacterium striatum M82B 
tetA 


Streptomyces anulatus pac 


Escherichia coli K12argK 


Streptomyces cinnamonensis 
A3823.5 mutB 


. — — ■ 

db Match 


_j 

*J s 
5 ; 

< CM 
CO <N 
CL t 
(/) Q_ 


pir F70577 
sp:PYRD_AGRAE 


! 

Q 
< 

m 
o 

CD 
I- 
CO 
UJ 

co 

CL 

Ql 


i 

o 

o 

LU 

i 

CD 

X 
CO 

> 

CL- 


cn 

CM 
CO 

>- 

a 

CO 

U 


prf.2513302A 

i 


prf:2513302B 


|pir:JU0052 


sp:ARGK_ECOLI 


o 
cr 

V- 

M . 
CO 

h- 

CL 
(/> 




ORF 
(bp) 


cn co 
co cn 


999 
1113 


LO O _ 

co oo 


OD ■«- 
co cn 
t m 

i 


o 

CM 

r-- 


603 
1797 


249 

1 DO / 


cn 

LO O 
CO CD 


1089 


2211 




Terminal 
(nt) 


1597745 
1599614 


1600677 
1601804 


1601931 
1603466 

1504629 


1604830 
1505281 


1506689 


1608248 
1505861 


1609335 
1607661 


1509842 
1610844 


1611150 


1512234 


50 


initial 
(nt) 


1598623 
1598667 


1599679 

1 £. 


1602281 
1602660 

1603520 


1605215 
1605811 


1605061 


1607645 
1607657 


1609087 
1609247 


1610192 

1R10236 


1612238 


1614444 






5171 I 
5172 


5173 
5174 


5175 

5175 

5177 


5178 


5180 


5181 

C i O O 

51 32 


5183 
5184 


LO cc 

O QC 
tO Lf 


5187 


5188 


55 


SEQ 

NO. 
! (DNA) 


1671 
1672 


1673 
1674 


m co 

CD ! to LO j 


1678 
1679 


1660 


1061 
1682 


1683 
1684 


1605 


1687 


1688 



BNSDOCID: <EP 1 108790A2_I_> 



124 



EP 1 108 790 A2 



5 
l u 


Function 


methylmalonyl-CoA mutase beta 
subunit 


S hypothetical membrane protein 




hypothetical membrane protein 


hypothetical membrane protein 


hypothetical protein 




ferrochelatase 


invasin 




aconitale hydratase 


transcriptional regulator 


GMP synthetase 


hypothetical protein 


hypothetical protein 


hypothetical protein 


15 


Matched 
length 
(a.a) 


o 

CD 


CN 
CN 




o 
r- 

CO 




CD 
CN 




CO 
CO 


CD 




CO 

m 

CO 




in 

CO 
CN 


CM 
CN 


CO 
CO 


CD 
*T 


20 


Similarity 
(%) 


68.2 


70.1 




87.0 


r- 

co 
r- 


72.8 




65.7 


56.5 




85.9 


81.6 


51.9 


o 

rvi 
x> 


80.2 


86.1 




Identity 
(%) 


41.6 


39.7 




64.1 


44.7 


51.0 




36.8 


25.5 




69.9 


54.6 


21,3 


32.6 


37.2 


61.2 


25 

O 
=J 

o 

L> 
•»«— ' 

CD 

35 


Homologous gene 


</> 

cr 

QJ 
c 
O 

E 

(O 

cr 
c 
o 

co 3 
E 

E uo 

S CO 

Q. CM 

O) CO 

£= co 
CO < 


Mycobacterium tuberculosis 
H37Rv Rv1491c 




Mycobacterium tuberculosis 
H37Rv Rv1488 


Mycobacterium tuberculosis 
H37RvRv1487 


Streptomyces coelicolor A3(2) 
SCC77.24 




| Propionibacterium freudenreichil 
I subsp. Shermanii hemH 


Streptococcus faecium 




Mycobacterium tuberculosis 
H37Rv acn 


v> 

o 

O 
QJ 

-Q 

— O 
c: tt 
- 

3 > 
£ rr 

X) > 

o q: 
u 

>-»co 
2> X 


Methanococcus janraschii 
MJ1575 guaA 


Streptomyces coelicolor A3(2) 
SCD82.04C 


Methanococcus jannaschii 
MJ1558 


Neisseria meningitidis MC58 
NMB1652 


4U 


db Match 


<_) 
cr 
t— 
co 

! 

< 
1 — 
ZD 

o_ 


h- 
O 
> 

1 

ro 

CO 
> 

CL 
cQ 




P 

b 
>* 

5 , 

o 
CO 

> 

CL 


o 

cn 

I Ol 


Cvl 

1 

O 

a 

LO 
Cl 
O 




q: 
u_ 
O 
cr 
a 

M 1 

LU 
X 
a 

tn 


o 
u. 
h- 

^— 
LJ 

1 

*T 

m 
□_ 

Cl 
(/> 




CO 

r— 

OO 

o 
u_ 

Cl 


pir:E70873 


CO 
CO 
T 

CO 
U_ 

C_ 


1 

CM 
CO 

Q 

o 

CO 
bL 

CO 


o> 

T 

CD 
UJ 

CL. 


CO 

1 

in 

tn 

CM 

o 
o 

LU 

< 

bl 
cn 






1848 


CO 
CM 

r-. 


CD 

in 


1296 


in 

CO 


CO 
T 
CD 


CO 
CO 

f>- 


1110 


1800 


CO 
CO 

*r 


2829 


! ^ 

1 CO 

m 


CD 

m 
r- 


CO 
CO 
CO 


r-- c 
to c 

CM C 


1392 


45 


Terminal 
(nt) 


1614451 


1617300 


1617994 


1518321 


1619672 


1620167 


1621838 


1621841 


1623027 


1625428 


1629107 


1629861 


; 1630668 


1630667 


CD C 
CM Li 

CO c 

CO c 
CD C 


1633324 


50 


Initial 
(nt) 


1616298 


CO 

r*»- 
m 
CO 

CD 


'61739B 


CO 

CO 
CO 

CD 


1620106 


1621009 


1621056 


1622950 


1624826 


1625925 


I °> 

• CM 
CD 
CN 
CD 


co 

CO 
CN 
CO 
CN 
CD 


' 1629913 


1631329' 


1631660 


»OJ 1 f*iD 

1631933 




SEQ 
NO 
(a a) 


5139 


CZ 
CTi 

i in 


CO 

in 


5192 


5193 


■c- 
co 

! in 


5195 


5196 


5197 


CO 

cn 
m 


5199 


O 

O 
CN 

m 


! 5 

CM 

m 


5202 


5203 


r in 

3 O 
N CM 

o I tn 


55 


SEQ 

NO. 
(DNA) 


cn 

00 

cc 


1690 


O) 
CD 


I ^ 

I ' 


CO 

cn 
CD 


O 
CO 


1695 


1696 


I 1697 


cr. 
I CD 


1699 


o 
o 
r-. 


1701 


1702 


1703 


t I in 
D o 

- I - 



BNSDOCID: <EP_ _ 1 108790A2_ I > 



125 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




126 

BNSDOCID: <EP 1 108790A2_I_> 



EP 1 108 



790 A2 



5 



10 



25 

CD 
3 



30 



35 



40 



45 



55 



Function 


hypothetical protein 


mtrogen fixation protein 


ABC transporter ATP-bindhg proteir 


hypothetical p'olem 


ABC transporter 


DNA-binding protein 


hypothetical membrane protein 


ABC transporter 


hypothetical protein 


hypothetical protein 




hclicase 


quinone oxidoreductase 


cytochrome o ubiquinol oxidase 
assembly factor / heme 0 
synthase 


transketolase 


cu 
t/\ 
n 

O 

ro 
in 
c 




Malchcd . 
length 
(a.a) 


CM 
LO 




CM 

m 

CM 


r-- 

CO 


ro 
o> 


CM 


co 

to 


r— 

CO 


CO 
CO 
CM 


CO 
CN 




CO 
rr 


CO 
CM 
CO 


LO 

cn 

CM 


m 

CO 


CO 

lO 
CO 








































u 


57.0 


CO 


89.3 


83.0 


73.0 


rr 

r— 


67.8 


77.3 


74.8 


CO 




51.0 


70.9 


66.8 


100.0 


85.2 




to 




































Identity 
(%) 


48.0 


64.7 


70.2 


55.2 


41.0 


CO 
T 


36.3 


50.2 


41.0 


43.0 




23.4 


37.5 


37.6 


i 

100.0 


62.0 




Homologous gene 


Aeropyrum pernix K1 APE2025 


Mycobacterium leprae nifS 


I Streptomyces coelicolor A3(2) 
jSCC22.04c 


Mycobacterium tuberculosis 
H37Rv Rv1462 


CO 

o 

CO 
CO 

o 

O 
C_ 

CL 
<n 
<2 
In 

>s 
O 
O 

jz -cr 
o r- 

4) o 
CO </> 


Streptomyces coelicolor A3(2) 
SCC22.08c 


Mycobacterium tuberculosis 
H37Rv Rv1459c 


Mycobacterium leprae 
MLCL536.31 abc2 


Mycobacterium leprae 
MLCL536.32 I 


Mycobacterium tuberculosis 
H37RvRv1456c 




Pyrococcus horikoshii PHC450 


Escherichia coli K12 qor 


Nitrnbacter winogradskyi coxC 


Corynebacterium glutamicum 
ATCC 31833 tkt 


Mycobacterium leprae 
MLCL536.39tal 




JZi 
T3 


to 
o 
to 

CM 

r- 
O 

9r 
D. 


pir:S72761 


i 

CM 
CN 
O 

u 
to 

Q- 
C7> 


pir;A70872 


CO 
> 

> 
CO 

1 

o 
>- 

id. 
tn 


gp;SCC22_8 


I s - 

CO 

o 
I s - 

LL 

a 


ro 

CO 

r- 

CM 

r-- 

cn 
ex 


CO 

r— 
r«- 
rvi 

CO 
CL 


r— 

co 

o 
r- 
O 

Cl 




CO 
LO 

O 
EL 


sp:QOR_ECOLI 


CO 

U 
CD 

O 
CJ 

5 

CL 

cn 


I s - 

I s - 
CO 
CO 
CM 

o 

CO 
< 
CL 

o> 


UJ 

_i 
o 

> 

J 
< 

cL 

CO 




ORF 
(bp) 


CM 

to 


1263 


CO 

to 


1176 


ro 

TT 


CO 

cn 
CO 


1629 


1 1020 


N- 
O 
CO 


cn 
cn 
tn 


I s - 
m 

CO 


1629 


to 
r- 
co 


O) 
CO 

a> 


2100 


1080 


1164 


Terminal 
(rt) 


1648709 


o 
o 

CO 

to 


1649367 


cn 

CN 
O 

to 

CO 


! 1651433 


1652894 | 


1655671 


; 1656700 


| 1657515 


in 
r— 
CO 

CO 

in 

CO 


1659140 


CO 
CO 

CO 
CO 


1662552 


1662630 


1666502 


1667752 


1099991 


Initial 
(nt) 


CO 
T 

m 

CO 

rr 

to 


1649362, 


CN 
CN 

O 

CO 


T 
CN 

LO 
CO 


1652875 


1653586 


| 1654043 


1 1655681 


; 1656712! 


1557677 I 


cc 
cr. 

cr 
in- 
to 


1659508 


1661578 


1663598 


CO 

O 

to 

CO 


ro 
>— 
LO 
CO 

to 
to 


CO 

r- 
r-~ 
co ; 

-li 


SEQ 
NO. 
(a.a ) 


CD 
CM 
fM 

tn 


5227 j 


5228 


5229 


5230 


5231 


5232 


5233 


CO 
CM 
tT; 


5235 


to 

CO 
CM 

in 


5237 


5238 


5239 


5240 


*r 

CM 
LO 


CM 

in 


LU ^ 2 


to 

CM 


r-~ 

CN 

r- 


j 1728 


o> 

CM 

r- 


o 

ro 

r- 


1731 


CN 

ro 

r*- 


1733 


s 

I s - 


LO 
CO 


1736 


1737 


1738 


1739 


o 

8 s - 


TT 
I s - 


CN 

I s - j 



BNSDOCID: <EP HC8790A2 I > 



127 



EP 1 108 790 A2 



10 



15 



20 



25 



c 
c 

u 
c 

li- 



ra 

x: 
Cl 

o a; 
O to 
-C TO 
CL C 

• cu 
to cn 

to 



U> XJ 



*— c to 

CO 0) ' 



CO 



30 



35 



40 



45 



50 



o 

5 

X 



a> a) 
to to 
o ro 



- "§.£ 

O Q> "o 

0.^ a. 
0) ^ >, 
P £ 5 

- a. £ 
a </> Qj 
a o i/) 

X XT C/> 

O an 



to 2 



P. < 

5 > 

o c£ 



o a: 



2 I 



a> o 

to CO 

a> to 

O T~ 

e >- 

(_> CD 
O CD 

ra cm 
CO CO 



CO 

<f 

LU 

> 



O 

CO 



o 
o 



.2 to 
ai cm 

QJ 

o I— 

u < 



CO 

o 

2,1 SI 

co < 

CL 



55 



LU ^ 

CO 2 



O 



o 



E 

a> 

E 

a> 

X) 
ro 



O) CO 
lO CN 
CM *~ 



CL 

to 



CO 

qJ to 

1 ° 

CD 4— 

P 



O 



£ 

o 

E i 

ra 3 
cntn 

eS 

2 < 

O CO 

o < 



o 
o 

) 

CO 
CL 



>— CO 

TO -r- 

-c: o 

u rr 

TO O 

co > 



10 Cl 
3 cn 
cn 

5< 



CO 

< 

LU 

> 

a 

o 
> 
a 
co 



O CO 

o < 



O 
u 

*' 
CD 

CL 



E 

_2 cn 

Eg 
1 o 



o co 
O < 



(O 

U 
CD 
< 
a) 



CM 



CD 
CM 



O 
CM 



a r> < j ? 



rr 

o 
o 



IS 

5 > 

£i > 

o cr 
o r-- 
>> co 
^ X 



5 > 

<-> a: 

.o > 
o CH 

o 

CO 

S X 



15 

" a: 

TO . 
XJ > 

o m 

u r-- 

>> CO 

2 X 



> 



CM 
CD 



co 
CL 

) 

u 
cr 

> 

ZD 



128 

BNSDOCID: <EP 1 108790A2_I_> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



u 
c 



CD QJ * — 



O 
O 



O 
O 

E 

o 
X 



■5 £ 



53 > 

X3 > 

o Ct 
5? 



\— 
O 
> 



> 



o 



E H 

a 



50 



55 



o 
u 

LD 

I 

CD 
CO 



ro 

<n o w 

TO X Q) 

ro ^ c 



o c 

£ > 
ro ro 



"C to 



CD 



cn 

CD 

oo 



co ^ *, 



O n < 



to 

CM 
IT) 



O 
CD 



CM i 



CO 
CD 



CD 
CD 

CM 



a 

CD 





-O 








CO 












rb 




CO 




aj 




ro 




'e 


US 


mo 




3 


o 


a> 


ro 


c 


XI 


CL 


o 


o 


c 




Acti 


plei 



< 



< 

CO 



CD 
CO 
CD 



cn 
cc 



CD 



CD 
CN 



CD 
CD 

CN 



CD 
CD 

r- 



2* 

S E 

o ^ 
to 

»1 

o cv 
a; ro 

71 ^ 



CD 




TO 




tn 




> 




CD 


Q> 


cei 


CL 


to 


O 


0) 




O CN 






rom 


YJL 


ha 


o 


u 


oo 


CJ 


oo 


ro 


CM 


CO CO 



< 
LU 
> 

LU 
(X 



ID 
CD 



00 
O 



CN 
5 



O 

u 

LU 



CO 



M 

to o 
I § 



a> 

X3 

CO < 



ao 
CO 



B ~° 

EES 

.2 2 co 

CJ 



ro 

« 8 



2 > 

- a: 
> 

2 X 



O 



ro 
u 

QJ 
X 

o 

Q. 
X 



5 > 

ro _ 
X) > 
O IT 

cn 
2 X 



o 



o 



o 



o 

CD 



Xt ±- 

3 E 

e : 



™ > 

-8 a: 

m 
2 X 



CN 
CD 



O 
CN 
O 



CD 
ro 
ro 



BNSOCCID:<EP 1108790A2 I > 



129 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2_I_> 



130 



EP 1 108 790 A2 



5 
10 


Function 


bacterial regulatory protein, orsR 
tamity 


ABC transporter 




ifon{(ll) ABC transporter, 
periplasmic-binding protein 


ferrichrome transport ATP-binding 
protein 


j shikimate 5-dehydrogenase 


hypothetical protein 


hypothetical protein 


alanyl-tRNA synthetase 


hypothetical protein 




aspartyl-tRNA synthetase 


, , . .. . . , ._ 

hypothetical protein 


; 

! 

CU 
tn 
ro 
T? 

o 
o 

cn 

ra 
.c 
a. 
cp 

C 

ra 
a 

ZJ 

cn 


phage infection protein 




transcriptional regulator 


15 


Matched 
length 
(a a) 






































cc 


o 

CO 




CO 
CO 


O 
CO 
CN 


cn 
in 

OJ 


LO 

cn 

CO 


CD 


cn 

CO 


m 
<r 




o> 
in 


r- 
cn 

CN 


cn 

CO 
CO 


CN 

<r 
r- 




CN 

cn 








































20 


Similari 
(%) 


68.7 


73.2 




50.7 


71.7 


60.0 


70.1 


69.6 


71.8 


84.8 




89.2 




53.6 


54.0 




62.0 




Identity 
(%) 


45.8 


35.9 




23.6 


38.3 


o 
o 
m 


CO 


52.8 


43.3 


65.4 




r- 


CD 


CD 
CN 


23.1 




29.2 


25 

OJ 

o 
o 

CD 

35 


Homologous gene 


Streptomyces coelicolor A3(2) 
SC1A2.22 


Corynebacterium diphtheriae 
hmuU 




Pyrococcus abyssi Orsay 
PAB0349 


Bacillus subtilis 168 fhuC 


cn 
i/i 
O 
ZZ 

u 
5j 
.o 

fc 

w LU 
OJ o 

IS ^ 
cc 

_c > 
a cc 
o r» 

f£ 


Mycobacterium tuberculosis 
H37Rv Rv2553c 


Mycobacterium tuberculosis 
H37Rv Rv2554c 


Thiobacillus ferrooxidans ATCC 
33020 alaS 


Mycobacterium tuberculosis 
H37Rv Rv2559c 




Mycobacterium leprae aspS 


Mycobacterium tuberculosis 
H37Rv Rv2575 


Saccharomyces cerevisiae 
S288C YIR019C stal 


Bacillus subtilis yhgE 




Streptomyces coelicolor A3(2) 
SCE6B.13 


40 


db Match 


CM 
CN! 

1 

cv 

< 

O 
cn 

bu 

CD 


CN 

CS,' 
CD 

cn 
O 

< 

ia 

CD 




CT 

S 

m 

."2 

CL 


ZD 

in 
u 
< 

CO 

t 

y 

X 

u. 

CL 


pir:D70660 


j pir E70660 


o 

CD 
CO 

o 

r«v 

LU 
i— 

a. 


UJ 
u_ 

X 

<' 

to 

CL 
</> 


sp:Y0A9_MYCTU 




UJ 

i 

o 
> 

> 

CL 

to 


r> 
h~ 
o 
>- 

1 

o 

CD 

b. 


cn 
< 

UJ 

> 

J 

X 
> 

< 

id 
</> 


Z) 

CO 

o 
< 

CO 

1 

LU 

O 
X 
>- 

CL 

to 




CO 

*~l 

CO 

CD 
LU 
U 
CO 

CL 

cn 




0T 

O ^ 


CO 

o 

ro 


1074 


id 
o 

CD 


uo 
cn 


CO 

m 


co 

CN 
CO 


1167 


CD 

to 


CO 
CO 
CN 


1377 


1224 


OJ 

co 


cn 
co 


2676 


165/ 


CO 
CD 


vr 
CO 

m 


45 


Terminal 
(nt) 


CO 
CM 
-5" 

OJ 

r» 


1722853 


| 1722202 


1723826 


CO 

r»- 

U"> 

OJ 

IV- 


1724612 ; 


1725459 


1726625 


| 1727385 , 


1730166 


1731599 


1732986 


CD 
T 

cn 
in 
co 

r-- 


1736004 j 


1738713 


CM 

m 
o 


1741906 


50 


ro — 

i 
i 


in 

CN 
• — 

fN 


o 

CO 
OI 


! 1722807 


1722870 


1723826 


cn 

CO 

T 
IT; 
OJ 


1725625 


1727170: 


CO 
*r 
O 

o 

ro 


1731542 


, 1732922 j 


1 

CO 


1735056 1 


1738679 | 


1740559) 


1741219) 


1741313 j 




S> 2 5. 


5294 


5295 


5296 


5207 


5299 


cn 
cn 

CN 


5300 


o 

CO 

in 


5302' 

! 


5303 


^* 
o 

CO 

m 


! 5305 


5306 


5307 


5308 


5309 


o ! 

co ' 
m ■ 


55 


2oI 

I C/) ^ o 




1795 


CD 

cn 


s 


1798 | 


cn 

cn 
r- 


o 
o 

CO 


§ 


IN 

o 

CO 


CO 

O 
00 


o 

CO 


cr> 
O 

CO 


1806 


1807 


co 
o 

CO 


cn 
O 

CO 


o 1 

5 1 



BNSDOCID: <EP __1l08790A2_l_> 



131 



EP 1 108 790 A2 



5 
10 




Function 




oxidoreductase 




NADH-dependent FMN reductase 


L-senne dehydratase 




alpha-glycerolphosphate oxidase 


histidyMRNA synthetase 

i _ — 


hydrolase 


cyclophilin 




1 

hypothetical protein j 

. . 1 


I 
I 


GTP pyrophosphokinase 


adenine phosphoribosyltransferase 


dipeptide transport system 


hypothetical protein 


protein-export membrane protein 




15 




Matched 
length 
(aa) 




ro 




CD 


CM 
CD 
T 




CO 
CD 
LO 


! 

cn i 

^ I 


CN 


LO 




CO 
CN 




o 

CD 


CO 
CO 


CD 
T 


CO 

LO 

LO 


CM 
ro 

CO 














































20 




Similarity 
(%) 




88.1 




77.6 


r- 


I 


53.9 


72.2 


62.1 


61.1 


! 


100.0 




CD 
CD 
CD 


100.0 


98.8 


CD 
O 
CO 


57.2 


— 






Identity 
(%) 




72.8 I 




37.1 


46.8 




28.4 



CN 
CO 


40.3 

. 


TT 

in 

CO 




98.4 




99.9 


995 


98.0 


o 

CO 


25.9 




25 
30 


Table 1 (continued) 


Homologous gene 




Cn 
CO 

< 

o 
o 
u 

o 
cj 

<A 

aj 

o o 

^ CO 

E r 

O LO 

CO CO 




Pseudomonas aeruginosa PA01 
slfA 


Escherichia coli K12 sdaA 




Enterococcus casseliflavus glpO 


Staphylococcus aureus 
SR17238 hisS 


Campylobacter jejuni 
NCTC11168Cj0809c 


Streptomyces chrysomallus 
sccypB 




Corynebacterium glutamicum 
ATCC 13032 orf4 




Corynebacterium glutamicum 
ATCC 13032 rel 


j Corynebacterium glutamicum 
ATCC 13032 apt 


Corynebacterium glutamicum 
ATCC 13032 dciAE 


Mycobacterium tuberculosis 
H37Rv Rv2585c 


(Escherichia coli K12 secF 




35 
40 




db Match 




CO 

i 

LU 

O 
CO 

c. 

CD 




LU 
< 
LU 
CO 

a. 

<■ 

LL. 
_J 
CO 

Cl 
m 


_j 
o 

CJ 
LU 

I 

X 
Q 
CO 

d. 




< 

CN 
CD 
CO 

ro 
CN 

*«r 

CN 

t: 

CL 


i— 

CO 

I 

X 

> 

CO 
Cl 
t/) 


gp:CJ11168X3J2 
7 


prf:2313309A 


I 


1 

LO 

CO 
CO 

ro 
a 

Ll_ 
< 

a, 
CD 




ro 

1 

to 

CD 
CO 
CO 

o 

LL 
< 

id 

CD 


CN 

1 

LO 

CD 
CO 

ro 
O 

LL 
< 
CL 

cn 


co 
CO 
CO 
CO 

o 

< 

CD 


h— 
O 
> 

1 

O 

CQ 
o 

> 

Cl. 

CO 


_j 

O 
a 

LU 

1 

LL 

u 

UJ 

CO 

CL 

(/> 








ii 


! ^ 


1113 


CD 

CN 

"~ 


LO 

cn 


1347 


CD 
co 


1685 


1287 


CD 

ro 

CO 


o 

CO 


r- 
ro 

CN 


LO 
LO 
CO 


CM 

ro 


2280 


LO 
CO 
CO 


O 
LO 


1743 


1209 


o 
ro 
CO 


45 




Terminal 
(nt) 


1742606 


1743813 


1743968 


1744519 


1746230 


1747588 


1746233 


1747990 


1749325 


1750933 


1751200 


1 1752051 


1752527 


1752615 


1754925 


1755599 


CO 

CO 

t 

co 
cn 
r^ 


1757589 


1760336 


50 




Initial 
(nt) 


1741893 


1742701 


CO 

N" 
oO 
ro 


1744025 


1744B84 


1746728 I 


1747918 


1749276 


1749963 


1750427 


1750964 


1751497 


' 1752186 


j 1754894 


1755479 


co 

JO 


1757228 


1 758797 


o 

CT. 
CO 






2§ * 


5311 1 


5312 


53131 


5314 


5315 


5316 


5317 


5318 


5319 


5320 


5321 1 


5322 


5323 


N 
CN 
i CO 
LO 


U") 

(N 
ro 

CO 


'. UJ 

I ^ 
m 

I LO 


5327 


5328 


SI 






LU y 2 


1Blfi 


CM 

cn 


' ro i tt 
j no co 


LO 


1 

j CO 


r-- 
co 


CO 
CO 


CD 
OO 


1820 


CN 
CO 


CN 
CN 
CO 


ro 

cn 

CO 


1824 


j 1825 


1826 


1827 


I 1828 


Rl 



BNSDOCID: <EP 1 10879OA2_l_> 



132 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



c 

3 



i <d x: 

nj qj - — 



o 
o 



0) 
."3 



E 
o 
X 



c ^ 
5 § 

CJ o 

' >* 

H 

si 

U 01 



0> 

p 
a. 



o 

GO 



cn 



03 




TO 




LL 








E 






O 


a5 


o 




tn 


ac 


CM 


qc 


a> 


o 


o 









CJ 
>- 

! 

Q 
co 
o 
>- 



O 



50 



■ So? 

ro 2 «?. 



55 



Sof §1 



CD 



O — 



w >- *-* 



5 3 m 

m - 



_ O 

o> u x: ^ 
r a a 



O 
CJ 

LU 

i 

CJ 

> 

cr 



O 

u 



u 

CD 
LU 
> 



CJ O) 

° < 

«£= a 

CO CO 



CO 
CD 



</> 

— to 
O C 
o ™ 



£ o 



a> 

a> 

c 
>» 

< 



o 







'</) 








ZJ 




o 




<D 














O 


E 


CT> 
O 




CO 


j5 


CN 

> 



JD > 

o or 

2 X 



to 



p 

CO 



a 

CO 



oo 



li. 

^ to 

£o 

o oo 

u oo 

ro cm 

CO CO 



CO 
< 
LU 



CL 

O 



E5 



~ cj 

CO CO 



CO 

o 



o 

CO 



e < 

<u to 
cn 
3 °- 
~ u 

£ CN 

.=? s 

-Q > 

o cn 

^ CD 
2 X 



(J 

CD 

X> > 
O LY 

IS 



Q 



3 
CO 

o 
< 



o 
to 
to 



CM 
CM 
O 

a> 
<o 



CD 



to 
cn 
to 



I in 1 cd 



BNSDOCID: <EP 110879QA2 I > 



133 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



<tl 



"(75 



e =8 



GJ 
C 

cn 



cn 

o 
o 

E 
o 
X 



40 



50 



co I '<r 

CO I LO 



55 



° O 
LU y 

LU U Z i 
CO 2 Q 



CO | CO CD I 



CO - *~ 
- O 
O *~ 



o 

Q. 



« ° 

3 £ 

~ Z3 

O O 

(0 c 

-O CL 

o o 

•S 5 

< Q. 



E 




tn 




~o 




-Q 




rg 




"53 




E 




o 




ro 


tr 


c 


0J 


CD 






"o 


o 


CL 




o 


c 


> 






CL 





00 

o 



CD ! 



(O N GO 
CD to (D 
CO ' CO I QO ! 



134 

BNSDOCID: <EP 1108790A2J_> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



c 
o 



i2 



5* 



c 
o 



-a 



50 



55 



8! 



LU 
CO - 



! ^ 



in 



; <o 



v ! in ; 

M N ! N ! N 

n I r, I n : n 
in I m ; in | in 



co : cn 

co co 
cn m 



■ m ! id J r» I cj 1 cn 
* — • s — I • | 
j o | a) i co | a) 

J — ! 1 1 1 ! I 



CO | CC 



o 



O 



It 



CM I CO 
CO | CO 



I IT) tO 
CO j CO j CO 



co i co 1 co I co 
in j m ' m i m , in 



in id 



LU 



to 

> 

s - 

v> o 

CD CO 

O CM 
>>CD 

E or 

S > 

2 a 

o co 

O CO 

CO CM 

CO CO 



CO . CO 



CO • O 
O I <r- 



< 



X 

> 
fi- 



eri 
<o 

CM 



o 

CO 



T : lO ■ 

lo in [ 
o — 



CO 



o I 

cn i 

. CO * . 

I m S m I m 



cn ct> | cn 



cn o ! r\j 
co , cn o 1 o 

co I CO i CQ j QO 



CN I 



135 

BNSDOCID:<EP _ 1108790A2 I > 



EP 1 108 790 A2 



5 
10 


Function 


sporutation transcription factor 












I 

i 

I 






hypothetical protein 










hypothetical protein 


insertion element (IS3 related) 


insertion element (IS3 related) 


j 




single-stranded-DNA-specific 
exonuclease 




primase 


15 


Matched 
length 
(a.a) 


CO 

cn 


















m 

rr 

m 










CD 
CO 

T — 


co 
00 

CM 


0 
t— 






CN 
CN 
CO 




CO 

CO 


20 


Similarity 
(%) 


in 
to 


















55.2 










75.0 


95.6 


CN 
rr 

CO 






CO 

0 

LO 




CO 

rr 

CD 




Identity 
(%) 


34.3 


















22.6 










63.0 


87.9 


72.3 






rr 

CN 





31.8 


25 

f 
3 
C 

o 
(D 

35 
40 


Homologous gene 


Streptomyces coelicolor A3(2) | 
whiH 






— - 








i ._ 




Thermotoga maritima MSBB 
TM1189 










Corynebacterium glutamicum 


Corynebacterium glutamicum 
orf2 


Corynebacterium glutamicum 
orfl 






Erwinia chrysanthemi recJ 




Streptococcus phage phi-O1205 
ORF13 


db Match 


CO 

I 

X 
X 

% 

CO 
< 
O 

in 

CL 

cn 














1 

1 


pir.C72285 










rn 

CO 

O 
to 
CO 

or 

CL 


pir:S60890 


pir S60889 






X 

0 
or 

UJ 

1 

— > 
0 

UJ 

or 

CL 




pir:T13302 




LL ■ — 


73B 


0 

CO 


CO 

LO 

rr 


i co 

I CO 


CN 

r- 

CD 


rr 


m 

CO 


CD 
CO 
CO 


O 

CN 


2202 


CD 
rr 


0 

CN 


rr 
rr 


CO 
CN 
rr 


rr 

co 
m 


rr 
CO 
CO 


rr 

00 

CN 


CO 
CN 


1299 


1878 


0 

CO 

r- 


1650 


45 


Terminal 
(nt) 


1814517 


1815651 


1815128 


1816636 


1817803 


1818219 


1818774 


1819166 


1819748 
1820181 


1824322 


1824589 


1824927 


1825178 i 


1826557 


1825751 


^r 

CO 
CO 
CN 
CO 


1829688 


1832063 


N 
rr 

0 
rr 

CO 

co 


1834149 


VJ 

CN 
CO 
CO 
CO 
CO 


50 


Initial 
(nt) 


1813780 


CO 
CD 
CO 

rr 

CO 


1815673 


1816451 


1817132 


1817803 


18184601 


18187981 


rr 1 CN 

> m co 
; cn ■ co 

CD 1 CN 
t- | CN 
CD ! CO 


1822577 


1824371 


1824784 


I 1825606 


1826024; 


1826644 


1826937 


1829900 


1830765 


1832167 


CN 
rr 

OJ 
CO 


1836675 




CO ^ 2 


to 
cn 
: ro 

LO 


5397 


539R 


5399 


0 
0 

rr 
in 


5401 


5402 


5403 


5404 
5405 


5406 


5407 


540B 


5409 


5410, 


^r 

LO 


5412 


5413 


5414 


I in 

rr 
in 


5415 


rr 

1 *o | 


55 


uj O 2 


to 
cn 


r-. 
cn 
cc 


CO 

0 

CO 


cr 
cr 

CC 


O 
O 
CD 


I 1901 


CN 
O 

cn 


CO 

0 

Oi 


1904 
1905 


CO 

0 

03 


1907 1 


CO 

0 

CD 


h909 


1910 




CN 

cn 


CO 
CO 


l = 
I s 


1915 


\ 

I 1916 


r-- 
00 



BNSDOCID: <EP 1 108790A2J_> 



136 



EP 1 108 790 A2 



5 
10 




Function 






1 
i 


helicase 




'c 
a 
c 
a 
o 

Q 

uo 
aj 

CT 
«J 

XT 
Q. 


5 
> 


















actin binding protein with SH3 
domains 








ATP/GTP binding protein 


^TP-dependent Clp proteinase ATP- 
Dinding subunit 


15 




Matched 
length 
(a a) 








o 
cm 
to 




OO 
O 




















CN 
CN 
*X 








CO , 


o 

CO 
CO 


20 




Similarity 

[%) 








i — 

T 

V 




CN 

to 




















CO 

mi 
T 








52.5 


o 
to 






1 Identity 

| (%) 








22.1 




36.7 


















i 


28.7 j 








23.6 


30.2 


25 
30 
35 


Table 1 (continued) 


Homologous gene 








Mycoplasma pneumoniae ATCC 
29342 yb95 




Bacteriophage N15 gene57 




! 






i 










Schizosaccharomyces pombe 
SPAPJ760.02c 




! 


1 


Streptomyces coelicolor 
SC5C7.14 


Escherichia coli K12 clpA 


40 




db Match 


i 






z 

Cl 

o 

> 

co« 

o 
>- 

Q. 
tn 




T 

*rr 

CO 

h- ; 
Q. 








1 












J. 

CN 

1 

O 
CO 

r^. 
~-> 
CL 
< 
CL 
CO 
bl 

CD 








U 

UO 

O 

CO 
Ol 

CD 


sp:CLPA_ECOLI I 






it 


3789 




co 
to 


1839 


LO 
CO 


CD 
CO 
CO 


to 

CD 
CO 


I 

CO 


r-- 
CO 
LO 


CO 
CN 
UO 


CO 
CD 
r- 


to 

CO 


CN 
CO 


CO 

ro 

TT 


CD 1 

| 

LO 


1221 


CN 
UO 
CO 


1395 


o 

OO CO 
LO <r- 


1257 . 
1854 


uo 
CD 
rn 


45 




Terminal 
(nt) 


1842137 


CO 

to 
rr 

CO 


1843337 


1845356 


1845857 


1846207 


ro 
ro 
ro 
to 

co 


(M 
CO 

cn 

r*~ 

CO 


^ 

CO 
T 

CO 


1849036 


UO 

cc 
r- 

CD 

cc 


to 
to 

o 
cn 

T 
CO 


CD 
O 

o 

LO 
CC 


1849978 


1850474 


o 

CN 

LO 
CO 


1852324 


1853873 


1854854 
1855237 


1856788 
1858738 


1860727 


50 


i 

i 


Initial 
(nt) 


1838349 


LO 
CO 

Cn 
CM 

CO 


o 
cc 
cm 
■v 

QC 


CO 

Co 
ro 

XT 
CO 


| 1845483 
r— 


| 1845872 
1 


oo 

CO 
ID 
ID 

co 


LO 

ro 

OO 


CO 
ro 
OO 

so 


CO 

o 
J - ) 

CO 

ao 


CO 

CO 

oo 

CO 
<T 
XJ 


CO 
CO 
33 


m 
to 
o 
o 

00 

ao 


1850415 


1851049 


1851220 


CO 

" — 

JO 
3D 


CD 

r- 
■^r 

CM 
LO 
CO 


CO 

ID H-) 

rN o 

TT CO 

jo m 

QO CO 
■ — i— 


CM UO 
CO CO 
UO CO 
1 UO CO 

; uo uo 

, CO CD 


1858763 




i 

L 
i 




□o 
rr 

uo i i 


CD 
«~ 

-O t 


o i 

-N 

KT ! 
^ I 


5421 


rvi 

CM 

JO « 


ro 
rvj 

jo 


v i 

M ( 
rr - 
r> i 


n | < 

"N ( 

r> |i 


£> < f 
■N , ( 

r> | i 


*- | CO < 
"N CN ( 

iO | uo V 


3> 
"N 
«T 

n i 


3 j 

-o 
«r 

JO 1 \ 


ro 
no j 


CM 

ro 

Sj 


CO 

ro < 

UO 4 


*T 

"O 
T" 

O < 


•o < 

JO i 


D ! 
■o CO 
vr I 
n j uo 


co cn 

CO 1 CO 
j LO LO 


5440 


55 


I 


°n if 

w 2 n 
— ' i 


ao 1 < 
C7> 1 ( 

i 1 


3> O 
«- 1 rvj 1 

3) • CD j 


1921 


™M < 
*N < 
J) < 


-o ! 

M ' ( 

=>j< 


XT | l 
M < 
J) i < 

~ 1 , 


3? l S 

~r 


o r 

M f 

d < 


^ CO c 
M CN ( 

d | cn | c 


D < 
M ( 
D < 

-!- 


3 I 
O 1 < 


- I 




1933 


<T l 
O < 
7> < 


r> < 

•o ( 

7> < 

r— i 


O CO 


co cn 
ro ro 
cn cn 


I o 



BNSDOCID: <EP 1 108790A2.I > 



137 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



■5 



to 



o 
cn 

o 

e 



CL 



CO 
CD 
CD 



O 



in 
cm 







t 




m 


CM 






CO 


m 


m 


CD 


CN 






CM 










1 


5 


CO 





° o 

UJ ^ 
CO ^ 



o < 



< 

o 

CL 



lo ^ a j ^ 



CO 



i S 
a. 
0) X 

I o 
co co 



CL 



.5 "» 

o ro 

o £ 

v £ 

Z= >> 

a, £ 

CL CU 



x 
o 

to 



o I CO 

CO j CO 
CO CO 



O ] CD j CD CD 



r-- I cm 

CO I CD 

CO CO 
CO 



□o 



<d ; CD j 



O 

in 

CD 



in 

CD 



II 

o }— 

o < 



CD 
CD 



E 
o 
E 

03 

Qj CN 

g s 

O h~ 

o < 



Qj 
O 

a. 



< 
o 



o 

CN 



< 



CO I CO CO ! 



ft) 

U CJ 
5^.CD 

£ - 

O CN 
Q_< 
CJ ^~ 

-b O 
CO CO 



CO I CO 



CO 1 CD 



O 

CO 



co } 



m I m i m 

CD I CD i CD 



in 

CD 



cn | 



CO 
CD 



BNSDOCID: <EP 1 108790A2. 1_> 



138 



EP1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



S9 ^ 
CO 



m 
to 

S c 

CO Cl 



o 

CL 



E 
o 

X 



O CO 
O LO 
O OJ 

c 

o cr 



tn r\i 

o CM 

a ^ 

re o 



S -g i 



o 

LU 

< 



ID 



50 



55 



LL — 

at 



CO 
CO 
CO 

*- 



So - 

CO 2 .2. 



CO 

co 
o 
co 



o 



O 0 < 

CO 2 Q 



o 

o 

LU 

I 

CO 

a 



o I co 



oo 
to 



Q I 2 J? | CT> j C; 



to j m 



CO ! ^ I 



ro J co 

in • t~ 

ao ; cr> 

O) | o 

cr, o 

co cr> 



0) 

o 



o 

CD 



CO 



CO 

cnj °* *~ 



O 0).O) 



o i 

GO CO 
- I CD 



o 






Cnj 




s 


CO 




CM 


c? 




CO 
















•| 




co 




m 


co ' 




CO 




l| 






m j 


m 



CO CO j 



BNSDOCID:<EP 1108790A2 I > 



139 



EP 1 108 790 A2 



10 



15 



20 



25 



(O V ' — 



CO 



30 



35 



40 



45 



50 



3 



C 

o 
o 



0) 
,CTS 



d: 
o 



2c - 

a) ^ e 



55 



O ^ < 

LU ^' ^ 
CO 2 Q 



E 

O 



CD i CD O 



in i in | in 



CTl I CD j CO I CJ) 
CD ! CD J CD I CD 



O 
Q- 

15 



CO S. 

5 > 
o cr 

u r-- 
>>ro 
2 X 



CD 

o 



CO I r- 



r- \ CM 
O i o 



o t~ 
o o 
o o 



O 
IC 



O 
O 



CD 

m 







o 




</> 




ro 








c: 












</i 




n 




o 




o 




o 




o 




ano 


ro 




5 




— > 


2 





o 

cm m 



00 
CO 



CD ) CD 



O 



CO CD 
O | O 

mm; 
in I in i 



o i 



BNSDOCID: <EP_ 1 108790A2_I_> 



140 



EP 1 108 790 A2 







i 
























tein 












i PS1 protein 




5 




c 
o 
























in PS1 proi 














10 




Fundi 




j 






J 








surface protein 






i 

major secreted protei 
precursor 






E 
c 

b 

CL 

o 
< 
Q 






major secreted proteir 
Drecursor 




15 




Matched 
length 
(aa) 








1 




i 






I 

I 






o 

OJ 






i 

o- 
cn 

LO 






rr 
CO 




20 




Similarity 
(%) 
















i 

i 


rr 

rr 






rr 

LO 






o> 
o 

LO 






• rr 
; in 








Identity 
(%) 


















23.0 






30.7 






23.8 






cn 

OJ 




25 
30 
35 


Table 1 (continued) 


Homologous gene 


















Enterococcus faecalis esp 


j 




Corynebacterium glutamicum 
(Brev/ibacterium flavum) ATCC 
17965 cspl 


1 




Escherichia coli topB 






Corynebacterium glutamicum 
[Brevibacterium flavum) ATCC 
17965 cspl 






Match 
















I 


< 

i 

CO 
rr 


j 




_C0RGL 






.ECOU 




i 


CORGL ( 




40 




db 


















cn 
o 

CO 
CM 

if 

Ol 






sp.CSPI 






sp:T0P3_ 


! 


i 


sp:CSP1_ 








LL — 
rv CL 

o-S- 


1191 


rr 
co 

uo 


Si 


j 444 


ro 

in 


CO 

o 

CO 


CD 
OJ 


cr> in 

O CO 

ro co 


CO !^ 

oj cn 

CO OJ 


CO 
CO 


cn 

OJ 

rr 


1581 


2430 


CD 
CO 


2277 
2085 


*- OJ 
cn ro 
co 


r- 


CO 
CO 


o> 

OJ 


45 




Terminal 
(it) 


1940135 


1938531 


"TT 

rr 

CO 

O 
cn 


o • 

m ' 

LO 

cn 


1941732 


OJ 

CO 
OJ 
T 
CD 


1943310 


CO tt 
ID (D 

cd m 

CO rr 
T tt 

cn cn 


CO LO 
O CO 

co un 
kt m 
rr -a- 
cn m 


1945952 


cn 
o 

CD 
CO 

rr 
cn 


1947070 


! 

CN 

o 
cn 
rr 
cn 


1951619 


1952546 
1956203 


o m 
LO vo 
rr r— 
co cn 
lo m 
cn cn 


1960371 


rr 

CO 

cn 


1963139 | 


50 


i 


ro - — 


1936945 


rr 
ID 
o 

CO 

ro 

cn 


"N 
O 
rT 


r^. 
O 

«— 
rr 

CO 


T 

00 

r\> 

«T 
31 


o 
cn 


un 
cn 
o 
ro 
rr 

cn < 


1943345 
1943680 


1945435 
1945891 


1946332 


51 

S l 

cr> . 

^ i 


o 
m 
cn 
co 
rr 
cn 


o 
jn 
rr 
«— 

JO | 

cn < 


Ln 
cn 

<T 
*M 

n 
3i 


oi 

oj ro 
cn rvj 
rr co 
n m 
cn cn 


1959340 
1960196 


rr 

CD 

cn 


1963000 


CO 
f>J 

■o 

£> 
33 




l 


2 o ^ 


o 

m i 


— 

n t 

n t 


s . 
n (t 


r> 

O I 

n i 


<3" 

n i 
n i 


n 

n i 
n ; 


n r 
n i 

D i 


5518 


5519 


! oj ( 
i in i i 
j m j i 


V 1 

=4- 


5523 


T ' I 
"M . ( 
-O • I 
-O « I 


n 
>j 
n 

n t 


3D26 | 

5527" 


co j cn 

CN j (M 

m tn 
to j m 


5530 


5531 


O 

n 
n 


55 




o n <i 


o ; — < 
oio < 

"M j CM ( 


N ( 

5 < 

M c 


o - 

Nl C 


?T l 

3 ( 
M < 


n ( 

^ < 


O f 

D C 
N C 


- I co 
•m I o; 


I cn o 

j - OJ 

o o 

i OJ OJ 

1. 


2021 


\i 1 

?! 


CO | - 
CM t 
O C 

OJ c 


* ! v 
n|c 


O i i 
\i I ( 
D < 
>J C 


D • o- 
x 1 Oj 

\! , CN 


co j cn 

« Ol j OJ 

loo 

OJ j OJ 


2030 


2031 


o 




































1 1 L_ 





BNSLXXID: <EP_ 110879OA2 I 



141 



EP1 108 790 A2 



c 

ZJ 



10 



CO 



Q 

Qi 

c 
CO 



CD 

c 



CL 



?5 



"5) rt 



ID 

rsi 



20 



25 



30 



CD 



c: 
o 
o 



,co 



CD 

O 

o 

E 
o 
X 



35 



40 



45 



50 



55 



St 



E 



So - 



m . in i in 
m un | in 



m O 2 i 



c I o 

CM | CN 



j cn I cn j o I 



8 



OlOlOi 
CN ' CN ; CN 



CD 



o : o ! o 

CN j CN | CN 



Q 



CL 

CO 



< 

o 



! CN 

CL 



o 

CM CO 

cr> i cd 



CD 
i j CD 



in i 10 in 



csi I 



CD : - 
CD . O 

*- i CN 



m > in I in Un 
in I in in I in 
m in J in j m 



n I \r m | 
in in , m 
o | o o : 

CM CN CN 



mm! 
o O ; 

CN ' CN 



142 

BNSDOClD: <EP 1 108790A2_L> 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



r ea- 



rn <D 
2> - 



i? : 



,CTJ 



50 



55 



T3 



K _g ; co 
O - I ™ 



| CO 



QQ 

CO 



co 

o cn 



C — CO 



CO ^ *L 



G 0 < 
m U z 
to 2 u 





CO 










CO 


CD 


CN 


n 


CO 


XI 


CO 


co 


CO 


CD 


cr> 


Ol 








o 




CN 


CO 


CO 


CO 


m 


to 


m 


in 


in 


in 



O ; 
CO CO j 
O : O 
CN CM 



> 



o 



±± CN 

> _J 
2? CD 

CO CJ 



! M 



i 2 



JO 
o 

E m 
■E to 



CM 
> -J 

£ CD 
m o 



cn 



o 
SB 

E 5 

m o 



ro 

a> 
c 

o r- 
O o 



o 



o 
m 



CO 



CN | CM fN 



CO ! CO 
O j O 



CO 



o 

CO 



CO 



CO 
CO 



E - 
o 

- — > 

co cn 



co 

CN 
CO 



O 
CJ. 



LO 

a 



a> 

CD 

O w 

CJ o 

en in 

5 3 

co a> 

E cL 



in 

CN 



e8 
Ie 

3 - a 

c3 5° C= 



CO O) 

CO I CO I 

o o 

CN CN ! 



CO 
OJ 
CO 

CD 



to I 



o 

CN 

O I «- 
o> I O 

0> I CD 



o 

o 
u 

I 

Q_ 
CO 

u 



m I m ; m I m 



i I co 

i ■ o 

t I CN 



! co ! 



o 

CN 



I ° * 



143 



BNSDOCID: <EP 1 10879OA2 I > 



EP 1 108 790 A2 



5 
10 




Function 


sodium-dependent transporter 
hypothetical protein 


riboflavin biosynthesis protein 


potential membrane protein 
methionine sulfoxide reductase 


hypothetical protein 


hypothetical protein 


c 

t 
- 

u 

Q ' 

CD 
t/i 

ra { 
_C0 

o 

c 
o 

-O 


u 
0 

5. 
/i 

2 
5L 
o 
i 
1/1 
o 

13 
>^ 
X 

? «> 

>. lO 
X CO 

c x: 
cj ~ 

? £ 

T— CO 


RNA methyltransferase 


hypothetical protein 


u 

Ti 

c 
Cl 

in 
O 

c a> 

CL 1/3 

Z ro 
~ O 

0) 

|l 

o o 
"O c 


hypothetical protein 


15 




Matched 
length 
(aa) 


00 CM 
CO CO 


CO 

ro 
CN 


-ST CO 

CO CM 

co t- 


CN 
CO 
CN 


T— 

o 

CN 


CO 


CO 
CO 


CN 


CO 

CO 
CN 


o 


o 

LO 


20 




Similarity 


76.1 
61.5 


64.4 


CD LO 

<«- r- 
CO 


CN 

r-- 


CO 
CO 


CO 
CN 
LO 


78.5 


ro 

CN 
LO 


CN 
CO 


CN 
CO 


70.7 






Identity 
(%) 


CO CO 

oS CO 
ro 


33.5 


to CO 
CN ^ 


55.2 


55.7 


25.9 


55.3 


25.4 


38.1 


55.0 


o 

CO 


25 
30 


Table 1 (continued) 


Homologous gene 


Helicobacter pylori 26595 
HP0214 

Bacillus subtilis yxaA 


Mycobacterium tuberculosis 
H37Rv Rv2671 nbD 


Mycobacterium tuberculosis 
H37Rv Rv2673 
Streptococcus gordonii msrA 


Mycobacterium tuberculosis 
H37Rv Rv2676c 


Mycobacterium tuberculosis 
H37Rv Rv2680 


Haemophilus influenzae Rd 
KW20 HI0390 rnd 


j Streptomyces sp. CL190 dxs 


Thermotoga maritima MSB8 
TM1094 


Mycobacterium tuberculosis 
H37Rv Rv2696c 


Streptomyces coelicolor A3(2) 
SC2E9.09dut 


Mycobacterium tuberculosis 
H37Rv Rv2698 


35 








— 








1 












40 




db Match 


ZD 
CO 

o 
< 

CD 

lo 3 

CD >< 

LL > 

.U: CL 
cl in 


CD 
CO 
CO 

o 
r- 
CJ 


CN 
CO 

00 £ 

CD 00 
CO CN 
CD *~ 

p- u_ 
LU < 

.t= o 

CL C 


pir;H70968 


pir:C70528 


LU 
< 

X 

(X 
cL 

cn 


T i 

CO 
CO 
CO 
CN 

o 
trs 
< 

CL 
CO 


CO 

CO 
CN 
CN 

LU 
Cl 


pir:C70530 


o 
(J 
cr 

h- 

CO 

1 

K~ 
ZD 
Q 

iei. 

CO 


o 

CO 

LO 

O 

r-» 
LU 

Ol 






si 


CO (N 
O ro 

CO TT 


LO CO ', CO 

ro co 
co co to 


2 « 

CN § 


CO CO 
CN CO 
CO 


CN 
CO 


1263 


1908 


1236 


CM *~ 
CO CO 
CN CO 


t 


co 

LO CM 


45 




Terminal 
(nt) 


1995783 
1996537 


1997112 
1997503 

1998240 


cn a 

LO O 

cn o 
a 

co o 


1999707 
2000521 


2002112 


2003334 


, 2003402 


CN 
(O 

LO 

o 
o 

(N 


2006979 
2006777 


2007738 


2008798 
2008876 


50 




initial 
(nt) 


1995083 

1 3 Z? \J \ CV -J 


1995758 
1997165 

1 0Q7C/1C 


1998289 


I J*1i 

2000132 
2001216 


CO 
CO 

o 
o 

CN 


2002072 


2005309 


2006697 


20066G8 
2007637 


2008184 


2008250 
2009082 






° o ' 


- CO J o 
re co 
ro lo to 


5581 
5582 

5583 


5584 


n co | 

o CO I CO 

r> in : lo 
o lo m 


5588 


i co 

CO 
' LO 

! to 


I 5590 


5591 


5592 
5593 


5594 


5595 
5596 


55 




2o; 

CO 2 ' 


| (UNA) 

2079 
2080 


2081 
2062 

2083 


I ^ 

! CO < 
| CN |, 


2086 
2087 


2088 I 


2089 


I 2C90 


j 2091 


| 2092 
I 2093 


2094 


LO CO 
CO CO 
CD O 
CN CN 



144 

BNSDOCID: <EP 1 108790A2_L> 



EP 1 108 790 A2 



w 



15 



20 



3 



" c oj 
ra <u — 

2 ~ 



25 



30 



35 



40 



45 



CD 



00 



o 

cn 

O 
E 



II 



o 

CO 



Is 
I? 



o> I 



CD 
















E 








O 




tx 




< 




cr 






o 




o 


on 


o 




ip 


o 






ro 


u 


E 


ns 


cn 


ro 


CD 





o 
-O > 

o cr 

:>> CO 
S X 



50 



55 



° O 

LU W 
CO 



2 2. 



o 
co 

cn 



O 
a 
a 
a 



2r< 
o cn 
O 55 



1 S 

i CO 





CM 


cn 


CO 




CO 


CO 




o 




o 


O 


CM 


CN 



QO 
I s - 



O 

a. 



E 



E £ 

.O > 

o £T 



cn o 
cn ■ o 
u> CD 



E °> 

5 > 

o 

-D > 

o cr 



E 00 



2£ > 

-D > 

o cr 

o r-- 
j^co 



o 



O O 

V) v> 

</» »/> 

a. a. 



o 
o 



II 

o .9- 3 



CN 
CN 



E 

CL 



CL 

Q 
D 



CO 



>-. o 
E 00 
o ° 

a> X 
^ O 
CO CO 



o 

ID 



o 

CD 



O 
CN 



CD 

CD 
CO 



CN 
O 



U0 

o 



I 
o 

CO 

CL 
CO 



E cr 

•§ ° 

a3 Co 

«8 



ro 



O h- 

o < 



o 

CO 



o 

CN 
CN 

Tf 
CL 



O 
Pi 



<D CO 



a* 

o i— 
O < 



o 



E E 

3 3 
O ™ 

E -2 

-° — 

cn > «o 

<y CO 

.1 £ E 
a5 co J5 
- ( ; <C c 

~* GO <1> 

£o£ 

era ° 

o f— u 
U < ™ 



3 

LU 

cr 
m 



< 
CD 



CD 

to 



O 
CD 



CN 
fM 

o 

CN 



! cn 



: co 



CO 



uo 
co 



< 

cr 



X) 

Gl ' 

< i 



xj > 
o or 

>^co ro 



I 



< 

LU 



1323 


m 

CO 


CO 


CO 


•^r 


r-» 


CD 


co 


ro 


CO 


CN 


Cm 


O 


O 


CN 


CN 


020270 


025423 



uo 

CN 



O 
CD 



co j 



BNSDOC1D: <EP 11C8790A2_L> 



145 



EP 1 108 790 A2 



10 



15 



CD 0> 

2 - 



OJ Q 

o ££ 

>> O 



20 



E 

CO 



25 



30 



35 



40 



45 



50 



55 



□O 



E 

O 

X 



cr: 



o 
o 

LU 

I 

cr 
> 

O 



c 

£ ; 







m 


i_n 




00 




CM 


CO 


o 


o 


m 


CO 


CO 


s 


o 






CM 


CM 





o 

O 
LU 

<*: 

Q_ t 

I ! 



a 0 

CO 2 



to ^ a 



o 

CM 



CD 

in 



Q- CL 
O O 



o 
a. 

I s 

5 ^ 

£■ * 

O c 

c fo 

<D ^ 

O O 

JZ JZ 
O- CL 

«/) </> 

o o 



CD 
IT) 



a. 

IE 

Q_ 
O 

E 



c 




o 




Z3 




OJ 




cy 








O 




To 




_c 




CL 




i/> 




o 




JC 




CL 




CO 






O 


"o 


l/S 




It. 


CD 


0J 


CJ 




>. 


a. 




cu 







y E T 

m ' -2 ™ 

in U 

LU ! CO CO 



CO 








O 




CU 




as 


0> 


c 


</) 




CO 


J* 


c 


o 




o 


o 




o 


o 




O- 

L0 


hot 


o 


CL 


-ph. 


hos 




CL 



5r q 



a. 



CM 

LU 

o 

CO 



o 
to 
cn 



CO 

o 
< 

CD 



a 



O 
o 

LU 

I 

a. 

CL 

O 



< 
o 
O 
x 
cr: 



CL 



5 



o 
o 

LU 



O 
CM 



CO 
CM 



cnj ; 

CO ! 



to 

CD 



SI 



CM 
OJ 

CM 



O 
CD 
CD 



CL 
O 

E 



ro 

X 
= Cl 

ro m 
00 CD 



OJ 

a; 
ro 
cu 
E 



r-. 


cn ; 


o 








u> 


! 

o 


o 


o j 




CO 



CO 

O 
< 
CO 

I 

CL 
X 



LO 
CM 
T 
O 
CM 



CO 
CM 



04 
CM 



E ; 

o 



co ! 



to 

CM 



O 

CM 



co ; 
II 



E 
a> ~> 
ro ^ 



< 
X 

I 

U_ 

Q_ 
< 

a 



CM , CM 
OJ I 04 



CD 
O 



o 

CM | 



to 

rM 



O 

CM 



CD i CD [ CO 

m , m | lo 



146 

BNSDOCID: <EP 1 108790A2_I_> 



EP 1 108 790 A2 



5 

10 

15 
20 
25 

T? 
O 
=J 
C 

o 
c^ 

30 

CJ 
-Q 

35 

I 

i 

40 


Function 


tRNA defta-2- 

isopentenylpyrophosphate 

transferase 




hypothetical protein 






hypothetical membrane protein 


hypothetical protein 


glutamate transport ATP-binding 
protein 


Neisseria! polypeptides predicted to ! 
be useful antigens for vaccines and j 
diagnostics j 


glutamate transport system 
permease protein 


glutamate transport system 
permease protein 


regulatory protein 


hypothetical protein 




biotin synthase 


pulrescine transporl ATP-binding 
protein 


hypothetical membrane protein 


Matched 
length 

(a a) 


o 
o 

CO 




to 
rr 
rr 






o 

CD 


XT 

! ' 


CM 
T 
CM 


i 

! - 


in 

CM 
CM 


CO 

Pv 

CM 


CM 


rv, 
CO 




rv. 
Ol 


CO 
CM 
CM 


CO 
CM 
CM 


Similarity 
(%) 


68.7 




75.7 






63.7 


86.4 


99.6 


73.0 


100.0 


99.6 


66.9 


71.6 




61.4 


69.5 


58.8 


Identity 
(%) 


o 

o" 




48.5 






29.0 


68.4 


99.6 


66.0 


100.0 


99.3 


m 

CO 


40.3 




33.0 


33,2 


24.6 


Homologous gene 


Escherichia coii K12 miaA 




Mycobacterium tuberculosis 
H37Rv Rv273l 






Mycobacterium tuberculosis 
H37Rv Rv2732c 


Mycobacterium leprae 
B2235_C2_195 


Corynebacterium glutamicum 
ATCC 13032 gluA 


Neisseria gonorrhoeae 


Corynebacterium glutamicum 
ATCC 13032 gluC 


CD j| 

1 s 

u <£ o 
to o ™ 

H !D tj. 


Mycobacterium leprae recX 


Mycobacterium tuberculosis 
H37Rv Rv2738c 




Bacillus sphaericus bioY 


Escherichia coli K12 potG 


Bacillus subtilis ybaF 


db Match 


— i 
o 
u 

LU 

s' 

CL 
m 




CO 

o 
to 
o 

ev- 
en 

Cl. 






pir:C70505 


LJ 
_J 
CJ 

> I 
t 

to 1 
cn 

> ; 

Cv 


— j 
CD 
or 
O 
o 

<' 

=) 
_i 
o 

"ol 


GSP:Y75358 


_j 
O 
or 
o 
o 

i 

u 

3 
_J 


o 
or 
o 
u 

1 

Q I 

_J 

O 
Cl 

to 


sp:RECX_MYCLE 


oo 
r-v. 
co 
O 
rv. 
< 

ex 




X 
CO 

u 
< 

O 

CD 

ol 

V) 


_j 
O 
o 

LU 

1 

o 

r- 

o 

CL 

bl 

1/1 


CM 
T 

Ol 

to 

LU 




81 


CO 
C 

o 


lO 
CO 


1359 


o i 

CM 
O 


1023 


CD 

to 
to 


1566 ' 
726 


Ol 
CM 


•XT 
CO 

to 


Ol 
CO 


CD 
IO 


■*r 

CO 
CM 


co 

CO 

rv. 


to 
rv. 
m 


Ol 

cn 

CO 


cn 
o 

to 


45 


Terminal 
(nl) 


xt 
co 

CO 
CM 

to 

G 

cm 


2053509 


2055761 


2054724 


2056787 


2057120 


2057355 


cn 

Ol 
TT 
O 
CO 

o 

CM 


2060196 


2062312 


2063259 


2063298 


T 
Ol 
CO 

to 

CO 

o 

CM 


2065667 


2067141 


2067866 


rr 
rv 

CO 
CO 
CD 
CM 


50 


Initial 
(nl) 


2053586 


CO 
CO 
CN 
"vj 
K't 

o 

OJ 


CO 

o 

rr 
TT 

u; 
o 

CM 


2055743 


to 
to 
rv. 
in 
u> 
o 

CM 

J- 


CO 

CO ; 
u> 

s I 


o 

CM 
XT 
O) 
ti> 
O 
Cvl 


*T 

rv. 
rv. 
Cl 

o 

CM 


T 

rr 
O 
UJ 

o 

CM 


2C61629 


T 

rsi 
to 

CJ 
OJ 


2063894 


2065627 


*r 
o 
rr 
to 

CD 
O 
CM 


2066566 


2067168 


j 

o 
to 

CO 

rv- 
to 
o 

CM 




S g « j 

<D 2 ^ ! 


5633 


T 

ri 

CO 

m 


5635 


5636 


5637 


5638 


5639 


o 
rr 

CO 

to 


rr 
CO 

to 


OJ 

"vT 
UJ 

«n 


ro ! 
CD j 


rr i 
<r | 

ID 

m : 


to 

CO 

to 


CD 

NT 

CO 

m 


5647 I 


CO 

rr 
to 
tn 


CD 

O , 
co i 


\ 


SEQ 
NO 
{ON A) 

2133 


CO 


2135 


| 2136 


ro : 

-! 


2138 


<j> 

ro 
oj 


o 

CM 


^_ t 

"vf 
CM 


»• ! 
? I 

Ol 


ro 

CM 


rr i m 'to 
■vr j "tr j rr 

CM j CM j CM 


2147 
2148 


j 

Ol 
XT 

CM 



147 

BNSDOCID: <EP 1108790A2 I > 



EP1 108 790 A2 



10 



15 



en td 
^ c ra 
a> — 



c 

5 

o 



ro 
c 



Cl 



20 



25 



30 



35 



40 



45 



1 - 
CO 



0) 
3 

.E 
c 
o 
o 



0) 



-o 




<D 




V 








-o 




C 




<u 




CD 




ro 




E 




ro 




-D 




<JJ 




O 




c 




<u 






c 


CL 


QJ 


E 


o 


o 




o 


Q_ 



lO 
CM 



o 
£ 



o> 

CD QJ 

o o 

o. a. 

<u 0) 

o o 

03 CO 



i_ cm 

2 > 

o rv; 

-O > 

o tr 
o r- 
^ ro 
2 X 



I— 
U 



o 

lO 



^ > 

-O > 

o or 
S X 





LU 




C 


c 




rotei 


prot 


cx 


c 


CD 


o 


o 
c 


ro 


ro 


=> 


w> 


O 




Q. 


0) 


til 






0) 






<D 


3 


CO 




jo 


"5 


To 



CD < 



O 
CD 



50 



55 



co 2 ^ 



2 

CL 



< 



</> CD 

o h- 

< < 



CD 

ro 

'c 
o 
E 

<u 
c 
Cl 
in 

§< 

O °- 

O CL 

a.cn 
a> -J 
-t= CO 

co Q 



o 

Ql 



o 
Cl 



Cl 



to 



o 

CL 



CD 



o 
m 

(N 



o 

00 

co — 



O 



CD 



S O z I « 
co z B <n 



ro 


CD 


CO 




ro 


CT> 


a) 


CT» 


to 


to 


o 


o 


CM 


CM 




} (N 


in 


! U"; 


CO 


I CD 


LO 


I tf) 




i 


LO 


! in 






CM 


i - 



CO 

O 
CO 



en 



LL 
< 



O 
CN 



in 

CD 



CN | 



a> 
E 

O CD 
0.0 
CD T 

^ a 

CO CO 



E 
ro 

15 
cn 

i ? 

.2 o 
ro ro 

CO 

era 

o < 



E 

il 
■ § 

ro s> 

=5 o 
cn o 

c > u 

^ QJ O 

5? 



O 
CO 



or 
o 
o 

I 

or 
O 
>- 



5 

LU 

or 

CD 

I 

CL 
< 
Q 



21 

ro c 



E 



3 1 i 



lO 
04 



CO 
CD 



o 



I C>J 



o 



O 
CO 

co ; 



cn I r-j | cvi j 



O 
CO 

o 

CM 



CM 
CD 
CD 



CD 
CN 



U0 
^" CO : 
I CO 



O ! 
CN | 



co 
O 
CM 



O 
CM 



lO 

co 

CD 



CD 
CM 



CD 
Cm 



CvJ 
CD 



uo 
CD 



vsi 

CD I 



BNSDOCID: <EP 1 108790A2_l„> 



148 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



0) -C _ 
TOO — 



CO 



c 

> 

or 
o 

— Q; 
£ ™ 

o jr 
5- c 

>s 

ca </> 

■19 



o 

CD 



CD 

to 



o 

E 
o 
X 



50 



55 



O 
O 
J— 

E< 



c c 

^ E 

o E 



< 

or 
o 
o 

u_ 
m 



o 
ro 



o 
to 

CD 



O 
>> 



CM 
CM 



£r E 

O P 
o ra 



o 

CL 



O 

CO 

E ™ 

O f- 
Q.< 

CO CO 



O 
CO 



o 

Cl 



E 



O 

o 

Cl 
ra 



co 



o cr 

>> co 
2 X 



-5 co 

ai ™ 

_Q > 

o cr 

cj r- 

>> ro 

S X 



E 



co ^_ 



CD 
CD 



a 

! LU 
CO 



O 10 



o ^- < 00 
co ^ O j 



E ^ 

Is 

ro 

x: > 
o tr 

>%ro 
5 T 



to o 

CD 00 



r- CO 

CO CD 

CO CO 

O 1 O 

cm : cm 



CD j 



CO | 



o 



CD 
CO 
O 
OJ 



o 

CD 

o 

CM 



o 

(N 
O) 
O 



CO 




"«T 


o 


O 


LO 


ro 


CO 


a» 


en 


o 


o 




CM 




«_o 


r- 


r~ 


CO 


co 


m 


UD 



CO 

r- 
Oi 



o 
ci 



•I 1 
I I 



CD 



ro i= 



c 
E 
a> 

-o _ S 

-(DC 
w c o 

c .9 c 

2 R © 

all 

3 ro E 



o 
a. 



Q.X 
<D LD 

~ a 

co co 



o 

CO 



CO 

CD 



ro 
m 



ZD 
CO 

o 
< 

CD 

I 

< 
CO 
3 



co 

CD 



ID 
CD 



CD 



LU 
cx 
Cl 

: CD 
t CO 



.r? CO 
-D 

o £T 
o 

ro 
S X 



c. 

6 i? 

CD o 
< o. 

if 

2.1 

o & 
Cl ^ 
«n < 
C »- 
to 0J 

^ rr 
<u o 
"o a. 

o. c 
qj (0 
Cl i= 



o Q 
a. p. 



c ro 
§ co 

3 CD 



> 

ro . 

O £T 

ro 
^ X 



ro 



cm . r\i 



c j co I m i ro 

■«t c\ r- ; r*- 

rv « o o> cd 

C rw cm ro 

cio o| o 

CM ' OJ j CM 1 CM 



I CD j 



CM I 



[CM | 



o 
co 

CM 



co 


to 




ur> 


tD • 


UD 


OJ 


ro; 








CO 


i 


S| 








CM 



BNSDOCID: <EP_ 1 108790A2_I_> 



149 



EP 1 108 790 A2 



10 



15 



20 



-D 

(V £ ^ 

H c ro 



1*. 



25 



30 



35 



40 



45 



50 



CD 
=3 
C 

C 

o 

o 



QJ 



O 
CD 

O 

o 

E 
o 
X 



3 



Q3 </> 
O <u 



c o 



O CD 

3 £ 



CL 



.2 CO 

a > 

-o > 

o tr 

S X 



CL 
> 

to 



E ° 
o o 

So 

J= o 

CO CO 



CO 
CO 



o 

CL 



o 

CD 



CN 



CO 
CO 



CD 

CD 



co 
O 
U 
CO 



X 
CO 

O 
X 

q: 

i 

D 
X 

u 

c 



o 



T- CO 



o 

O -Q 



CD 
oo 



CM 

t: 



CL OJ 

cn 

IS 



CO 
CD- 

o 
E T 

O r- 

o.X 

O LO 

^ CO 
CO CO 



QJ 
E ^ 



! co 
« > 

o tr 



CL 
QJ 



CL 

O 
_J 

o 

I 

(J 

I 

CL 
> 



u 

CO 



CD O 

o 



So « 

co ^ S 



CO 



CD I 



O 
CD 



=3 O 
CD CD 



LLi 
O 

CO 

I 

X 
CO 

O 



c 




0) 




c 




o 




a. 




E 




o 




o 


o 


6 


ro 


or (twi 


: regul 




aj 




L0 




t= 


CD 


o 


QJ 


O- 






QJ 


Q) 


to 




C 


E 


a 


05 


CL 




tfl 




CD 


>« 




If) 



o .£ 
o -a 

6 



E 



55 



a Q < 

UJ y 
CO 2 O 



to 
to 



CD 
CO 



O 
CD 



CD 
CN 



CD 

CM 



o 

O o 



^CO 
o ^= 
(J O 



O CN 

E < 

QJ Ct 

□ Q 



CD 
O 
m 

CO 



< 
o 
ro 
ro 



LU 
< 



CO 

tO 

CI CN 
CN 



CO I o 

I CD I CD I CD I 
i CO CO CD 



■cr i lo I co 

CD I CD CD 
CN j CN I CN 



go ; cj> I o 
CD CD ; O 
r- CN 



o 



CN I 



CN 
O 
CN 



BNSDOCID: <EP 1 108790A2J. > 



150 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



CD 
I < 



5 c 2 

CZ to 

CTJ <U « — 

2 " 



E 

CO 



50 



55 



o 



q3 



o> i to 
to i «- 



2gf 



° O S 



CD 
O 



£ 
E 



E cn 

TO 



Is X 



3.2 

CD c 
in C 



CO 



Cn m 
TO 

8 I 
g; S ° 

o n to 

CL > ^ 



p 
ci. 

CD 



CI) 




fO 




J- 




CL 




t/i 




O 




x: 




CL 




CO 




i 






o 


O 


l/l 








a? 




E 




o 


tn 


X 


O 


o 


o 


0) 




~o 


T3 











o 



o 

CM 



a. 

CO 

O 



O 

o 

LU 

I 

cr 
x 
a 



CO 

r-- 
o 



t in a> 
o I o i n 

cn » cn i 



o 
oj 



ol 



CD 

tr 
o 



o 

CD 

< 







•E 








> 








o 








a> 




tn 




TO 




>> 




3 




ro 




E 




o 








CD 






TO 
> 


E 








f\J 




£Z 


CL 


O 



to 
to 



t/i 




%n 




o 








u 




q3 




-Q 












5 












TO 




-Q 


> 


O 




u 






5 


X 



o 



CJ 
TO 



to 



LO 
CO 



O 

n 



Si > 

TO 

_Q > 

o LY 

>"> co 
2 I 



o 

CO 
CD 



CO 

in 

CN 



TO 
*/> 
O 

c 
o> 

TO -Q 
tn u 
TO CN 

£ °* 

O CD 

E m 
o 

? o 



i — | ra 
CL < ! CD 



LLf 
< 
LU 

CO 

a 
< 

CO 

Q 

O 



LO j CO 



CN J 
CNj 



OJ 



o 

CO 
CN 



CO 



E T 

O ^r- 

Q. LU 
CJ CN 

CO CO 



ao : cn 

LO j LO 



o 




o 




cr 




h- 




co 


CD 


1 

CO 


CD 

to 


h- 


cn 


LL 


CO 


UJ 


< 


cL 




tn 


O 



cn 
ro 



o 

CN 

LO 



cn : 

LO 



CD 



O I <— 

OJ I CN 

CN J CN 

OJ CN 



BNSDOCID:<EP nOB79QA2 l_> 



151 



EP 1 108 790 A2 



c 

o 



LL 



10 



15 



20 



<V XL . — . 

~ ? 2 

*^ «- CO 

ra cu — - 



CO 



25 



30 



35 



40 



C 



o 

o 



o 

CL 



O 
CL 



o 

CL 

1 

0) 

CD 



O 

to 



o 
to 







V> 




o 








U 




a> 




JO 












i 


a> 




GO 


5 


<3 



o a: 
re 

x> > 
o cr 

>>co 
^ X 



to 



o 
> 



CO 



.2 co 

f > or 

ro ^ 

> 

o tr 

o r*- 
>*co 
^ X 



c 
aj 
o 



o I 



X 

> 



is 

3 > 

a C£ 
JO > 

c tr 

o 

>> co 
2 X 



1 

CO 

X> 

o 

O 



CO 
Z3 

o 
a> 
jd 

Z3 

|5 

.3 cd 
x> > 

O EC 

a 

co 
5 X 



o co 
X X 



2 (u 

CO c> 

J= TO 

in £- 



E c 



o 

Lf) 



CM 

cm 



50 



55 



CO 
CM 



> 



o 



X 



o 
>- 



to 

CD 
CO 



3o - 

. $ * 2. 



So si 

0D Z O | 



(O 


CO 


CO 


to 


CO 


o 


CN 






"VJ 






CM 




CO 




CN 


! CM 



to 

CM 
CD 



LL) 
< 

X 

! 

CM 

X 

!Z 



to </i 



x 



m 

CM 

t: 



cm 

CM 
CM 



O- CL 

Si 

P CL 



o ^ 

■Je 



UJ 

JC 



CO 

CD 



CD 
CM 



CM 

! ™ 



ZD 
CO 
O 
< 

m 

i 

UJ 

x 



CO 

(1) 

£ o 

O T- 

<D CD 

^ O 
co to 



o 

CO 



o 
o 

UJ 

I 

to 
X 



CM 
CO 



o 

O 

LU 

s' 

X 



CO 
CM 
I CM 



co 
cm 



CO 
CM 
CM 



i ™ 



i CM 

1ZL 



BNSDOCID: <EP 1 108790A2_I_> 



152 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



t3 cn <t 

m S a. 



E ~ 

CO 



o 



E 

CJ 

X 



o 

0 c 
0) — 

t/> O 

1 ? 

a 3 
■3 J2 

§.» 



CD ^ 



45 



50 



55 



LL! 

a 
cr 

O 

CD 

I 

X 
LU 



ro 

CO 



If 

cn 

E 5 



CN 



C ~ ! cn 



i ft* 1.1 - 



13 

E 

in 

o <w 
- in 

V 53 

S.» 

to -9 

v £ 

CO o 



5 

cu 


<D 
</) 


CO 
w. 


c 




'c 


Tn 


03 


c 






cn 




< 






QJ 


or 


E 



CO 
CO 

; *— 

o 



o 
u 

LU 

I 

Q 



c 
o 



O 
< 

z 

CO 
CO 



CD 

o 
E 

O r~ 

CL. 03 
CU LL 

±z O 
00 CO 



aj i U 
> » m 
£ < 



CO 1 CO 
CJ) 'J"> 



, 

«? E 
E c ' 

2 j 
- CO | 

So 
£ « 1 

cj CJ 



^ I CO I - 



! CD ; 
I ell 



T 

O 

ro 

CD 
CM 



CD 
O 



CO 


CD 




in 


o 


m 



I I 

I CO " -r- 



o 
CO 
CN 



5" I ; 



CN , 



I o 
■ tn 



I I 



CN 

5 



si 

< i 

*=r i 
m 

CO . 



o 

CJ 
LU 

I 

> 
CO 



^ CO 



CD 
CO 



CD 


o 








in 


CM 




CN 


CN 




CM 



in « io 

m * i - 

• i 

in ' 



m co 

m u-> 

CN CM 

CN CN 



CD j 
CO 1 
CM J 
CN 



BNSDOCiD: <EP 1108790A2_L> 



153 



EP 1 108 790 A2 



5 
10 


Function 


glucan 1,4-atpha-glucosidase or 
glucoamylase S1/S2 precursor 


chromosome segregation protein 


i 

; 

CJ 

LO 

CO 

^= 
Cl 

CO 

O 

% 

>> 
o 
ro 


transcriptional reguiaior 
hypothetical membrane protein 


cation efflux system protein 
formamidopyrimidine-DNA 
glycosylase 

ribonuclease III 


. 

hypothetical protein 


hypothetical protein 
transport protein 


ABC transponer 
hypothetical protein 


15 


Matched 
length 
(aa) 


1144 


1206 


CM 
CJ) 


lo r— 

o LO 
ro CM 


CO LO * 

CO CO CM 
t— CM CM 


CO 


CO CJ> 

ro lo 

CM LO 


r- CO 
■KT CO 

lo ro 


20 


Similarly 

(%) 


CM 

CO 
XT 


72.6 


73.9 


60.0 
73.5 


CO r- LO 
CO CO cc 
r— CO r- 


62.5 


cn co 

CD LO 

Ln 


58.8 
62.6 




Identity : 

(%) 


VIZ 


48.3 


51.1 


23 9 

39.3 


46.8 
36.1 


) CO 
> LO 
CO 


O CO 
O CO 
LO CM 


266 

35.3 


25 

TJ 

CD 
3 
C 

o 
o 

30 ~ 

CD 
X> 

35 
40 


Homologous gene 


Saccharomyces cerevisiae 
S288C YIR019C stal 


Mycobacterium tuberculosis 
H37Rv Rv2922c smc 


— — . — — i 

Mycobacterium tuberculosis 
H37RvRV2922.1C 


Escherichia coli K12yfeR 

Mycobacterium leprae 
ML.CL581.28c 


Dichelobacter nodosus gep 
Escherichia coli K12mutM or 
fpg 


CO 

tn 

1 ° 
-> 3 
- o 

§ * 

— o 

/i . e co 

1 la 

^ t> q: 

i/i ro . 
=> -o > 

= o cn 

co ro 
XI 2 X 


Mycobacterium tuberculosis 
H37Rv Rv2927c 

Streptomyces verticillus 


Escherichia coli K12 cydC 
Streptomyces coelicolor A3(2) 
SC9C7.02 


db Match 


h~ 
CO 
< 
LU 
> 

1 

X 

1 * 


3 

h~ 
O 
> 

I 

CD 
CD 
O 
> 
Cl 

CO 


3 
h- 

O 
> 

I 

CL 
> 

< 

Q- 


_j 
O 
u 

UJ co 

<x Z 

LLi CM 
U. r- 
> ^ 

CL 


CO 

S 8 
1 2 

cr> _ to 


\- 
O 
> 

CO 2 
CJ) 1 
CD Li- 
en CD 
CD O 
CD >- 
.'t: Cl 

Cl. CO 


3 

S g 

2. S 

o' 9 

CD O 

O 

X CM 

Cl t 
CO o 


—i 
o 

O ^ 

LU, 1 
1 r- 

o o 
> u 

O c/) 

CL CL 

co C7> ! 




ORF 
(bp) 


702 
3393 


s s 


282 
1854 


I co «- CO 
1 uo CO oo 

j CO | CD «- 


r^- lo co 

Tj* LO 

TJ" CO CO 




•n ro 
r-- in 


s I 


1530 
1122 
441 


45 


CU t- 


2177103 
2176110 


2181880 
2179628 


2183110 


2185351 
2187129 
2187342 


2187233 
2187692 

2188313 


2189166 
2189906 


2190540 


2194694 
2198004 
2198007 


50 


Intial 
(nt) 


2176402 
2179502 


2180913 
2183092 


2183391 


D CO CJI O 
■> O CD CO 
.1 CM CM <r- 

^> to to 

3 CO CO CD 
s) CM CM j CM 


2187679 
2188306 

2189170 


2189906 
2190439 


2191328 


2T93165 
2196883 
219B447 




SEQ 
NO 
(a. a.) 


O CM 
D CO CD 

- r- r-- 
o m ! uo 


5763 
5764 


5765 
. 


5767 
5768 
"5769 


5770 
5771 

5772 


CO T 

r-- >- 
lo m 


5775 

C77C 


Jf 1 u 
"5777 

5778 

5779 


55 


SEQ 
NO 
(DNA) 


2261 
2262 


2263 
2264 


2265 


o r— 1 «o o 

o CO I CO CO 
M CM I CM CM 
M CM j CM CM 


! O : - | CM 

■ CM | CM . CM 
! CM CM i CM 


co rr 

CM CM 
CM CNj 


2275 


O ■ r- co . 0 
- ( n r- | 
m j cm cm £ 

M CM CM 1 Cl 



BNSDOCID: <EP 1108790A2„L> 



154 



EP 1 108 790 A2 



5 



10 



15 



20 



25 



35 



40 



45 



50 



55 



Function 


hypothetical protein 


J peptidase 


sucrose transport protein 






J maltodextrin phosphorylase / 
glycogen phosphorylase 


hypothetical protein 


prolipoprotein diacylgtyceryl 
transferase 


indole-3-glycerol-phosphate 
synthase / anthranilate synthase 
component II 


hypothetical membrane protein 


phosphoribosyl-AMP cyclohydrolase 


cyclase 


inositol monophosphate 
phosphatase 


phosphoribosylformimino-5* 
aminoimidazole carboxamide 
ribotide isomerase 


glutamine amidotransferase 


chloramphenicol resistance protein ' 
or transmembrane transport protein | 


Matched 
length 
(aa) 


LO 

o 


CO 

to 

CO 


CO 
CO 






CO 


m 
cn 

CN 


T 

CO 
CM 


CD 
CO 


co 
CN 
CN 


tn 

CO 


CD 

in 

OJ 


TT 
CN 


m 

"C 
OJ 


0 

CN 


CM 
O 
TT 


>. 


































i§ Co 

!* 




CO 


CD 








T 


in 




CO 


CD 


r-. 


O 


CO 




O 


CO 


CO 


LO 






r--" 
CD 


CO 
CD 


ih 
ID 


CN 
CD 


CO 

in 


CD 

r— 


r*- 
CD 


TT 

CD 


CD 


CM 
CD 


TT 

lO 


to 


































>» 


































1* 


O 


cn 










cn 




CD 




CO 


CO 


O 


CD 


O- 


CO 




CN 


C\i 
CO 


CN 






O 
ro 


CO 
CO 


CO 


oS 

CN 


cn 

CN 


CN 
LO 


r*- 
O) 


CD 


to 

CD 


CD 
CO 


in 

CN 


Homologous gone 


Thermotoga maritima MSB8 
TM0896 


Campylobacter jejuni ATCC 
43431 hipO 


| Arabidops ; s thaliana SUC1 






Thermococcus litoralis malP 


Bacillus subtilis 168 yfiE 


Staphylococcus aureus FDA 485 


Emericella nidulans trpC 


Mycobacterium tuberculosis 
H37Rv Rv1610 


Rhodobacter sphaeroides ATCC 
17023 hisl 


Corynebacterium glutamicum 
AS019hisF 


Corynebacterium glutamicum 
AS019impA 


Corynebacterium glutamicum 
AS019 hisA 


Corynebacterium glutamicum 
AS019HsH 


Streptomyces lividans 66 cmIR 


db Match 


A72322 


UJ 

—> 
< 

o 1 

CL 
X 


S38197 






.2513410A 


z> 
co 
o 
< 

£33 

1 

LU 
LU 

> 


CO 

} 

h- 
O 
_j 


UJ 
LU 

1 

O 

CL 

cc 
i— 


H70556 


X 

CO 

o 

X 

cc 

1 

ro 
CO 

X 


O 
cc 
O 
0 

i 

CO 
LO 

X 


2419176B 


! 

CD 

CO 

LO 
O 
LL 

< 


1 

CO 

in 
in 
O 

CO 

0 

LL 

< 


_j 
cc 

h- 
CO 

1 

CC 
_l 

0 




pir 


b. 


CL 








Cl 


cl 
</> 


sp- 


CX 


Cl 

CO 


ds 


t 

cx 


Cl 
01 


id 

CD 


ids 


81 


1284 


1263 


.O 
ro 
ro 


LO 
CO 


CO 
CN 


2550 


o 
o 

<y\ 


co 

CD 


o 

CO 


r- 

LO 

lO 


LO 
CO 




m 

CM 
CO 


GO 
CO 

h- 


CO 

co 

CD 


1266 j 


c 

F H 

1— 


2199758 


2201070 


2201073 


O 

in 

^7 

CO 
CM 
CN 


o> 
in 

o 

CN 
CN 


CN 
CD 
CD 

O 
CN 
CN 


2204591 


2207302 


2208367 


2209232 


2209920 


2210273 


2211051 


2211882 


2212841 


2214321 


Initial 
(nt) 


2198475 


2199808 


CO 

? 
o 

CN 


2201584 


2201869! 


•vj 

in 

XT 

o 

CN 
CN 


r> 

CD 
TT 

in 
o 

CN 


CD 
N 
CN 
CO 

o 

CN 
CN 


2209167 


2209888 


2210273 


,2211046 


2211875 


2212619 


2213273 


221558C 


So; 
m 2, 


5780 


oo 
to 


CN 
CO 

r- 

LO 


s 

ft 


5784 ! 


5785 


5736 


5787 


CO 

cc 
r-. 

LO 


m { 

£ i 

■«n I 


'5790I 


CD 

m » 


5792 


5793 


5794 


to 

CD 


SEQ 
NO 
(DNA) 


2280 


2281 


2282 i 


CO 
CO 
CN 

"I 


CO 
OJ 
CN 


2285 


2286 


2287 


2288 


2289 


2290 


CD 
CN 
OJ 


2292 


| 2293 

I 


2294 


10 : 
o> 

CM » 
O. 



BNSDOCID:<EP 11 08790 A2 I > 



155 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



T3 

<V XT 

<o o» — ' 



CO 



CO 

o 

o 2 



O Q) 



£ c 
5 E 



CD 



O 

O 



o 
o 

E 
o 

x 



40 



45 



50 



£ 
o 

cl ^ 

<D CD 
CO JZ 



O 



^ — ! CD 
E C I in 



O 
O 
cr 



to 

X 
CL 



E 
o 



CO 

o 
to 



CO 
X 

CL 



t> o 

So 



X 
CO 

X 

CL 











E 




o 




CL 




i/> 




CD 




O 




>> 




E 




o 




TO 


CO 


n 


CJ 




o 
to 


LO 


cn 
O 


c\i 


ti 


O 


x: 


CD 


cj 


CL 


CO 


CO 



I 3 



O 



CO 
CD 
O 



O m 



55 



CO z o 



to 



to 

CM 
CO 



O 
CD 
CL 
00 

CL 

CD 



O 

CM 



CD t- 



cj a 



<< 

•e * 

0 C 

CL O 

2 ■= 

P u a> 

1 ° o 

-C ■ — Q- 
U H> U) 

III 



o 

LU 
CL 



GO, 



8 > 

o a: 



o 

° 9- 

$ CT 

a o 

r- 

E ™ 
o in 
HO 

CD CN 

±= O 
CO CO 



co 



CD «- 

*- ! to 

CN I CN - 
CN CM I CM 
CM CN 



CO 



r\i | co 



O 
CO 
CL 



CL 
CD 
C 



o 

CL 



CD 

to 



O 



a 
I 



x 
5, 



LU 



CN I 



CN 
CN 



CO i 
CM < 



O 

CO 
CM 



LO 
CN 
(N 



ZD 
CO 

a 
< 

CD 

I 

a 



CM 

id 



CD 



O 

o 

LU 

I 

X 

LL 

> 
CL 



CM I CM 
CM CM 
CN ' CN 



CD \ CO 
CO i - 



CD 
CN 



CD I ' 

cm ; i 

CM 1 CM 
CM 1 CM 



CO 

CD 
CM 
CM 
CN 



CD 
O 
CO 
CM 
CM 



CO CD ! 

O O j 

CO CO 

m to 



o 

CD 



CO I 
ID I 



co ! 
to • 



BNSDOCtD: <EP 1 108790A2_I_> 



156 



EP1 108 790 A2 



5 

10 

15 
20 
25 

0) 
=> 

.£ 
c 
o 

30 t- 

0J 

35 


Function 


DNA polymerase III epsilon chain 




maltooligosyl trehalose svrthase 


hypothetical protein 








alkanal monooxygenase alpha chain 


hypothetical protein 




rnaftooligosyltrehalose 
trehalohvdroiase 


hypothetical protein 


threonine dehydratase 






0 

c 
a 
< 

E 

C 

E 
c 
re 
a 
a 

E 

u 

TO 

-O 
OJ 

cr 

0 
U 


DNA polymerase III 


1 

c 
a 

c 
a 

tn 
C 

c 
"c 
<u 

-C 

a 

E 
re 

c 
u 


histidine-bindina orotein orecursor 


1 
0 

a) 
c 
ru 

; i 

c 
c 

"to 
0 

c5 
0 

CL 

>s 
-JZ 


, Matched 
/ength 
(a.a) 


tn 
tn 
j ro 

I 




CO 


CN 
CN 
CO 








to 
ro 


0 

Oj 




CO 
CO 

in 


N" 
CN 


CO 

ro 






uo 

T 


1183 


cn 
0* 
01 


i 

! - 


co 
cn 


Similarity 
(%) 


O 
tn 




68.6 


UJ 
CN 

to 








54.4 


oj 
cn 


CN 


N~ 
OJ 


99.3 




1 


CO 

oS 


80.5 


73.8 j 


55.7 


64.7 


Identity 
(%) 


23.4 


42.0 


27.6 








20.5 


58.3 




CO 
CO 
T 


36.5 


99.3 




1 
i 

! 


22.7 


53.3 


37.6 


m 

OJ 


22.7 


Homologous gene 


Streptomyces coelicolor A3(2) 
SCI8.12 




Arlhrobacter sp. Q36 treY 


Deinoccccus radiodurans 
DR1631 


I 

1 






Photorhabdus luminescens 
ATCC 29999 luxA 


Streptomyces coelicolor A3(2) 
SC7H2.05 




Arthrobacter sp. Q36 treZ 


Bacillus subtilis 168 


Corynebacterium glutamicum 
ATCC 13032 ilvA 




! 


Catharanthus roseus metE 


Streptomyces coelicolor A3(2) 
dnaE 


Escherichia coli K12 rarD 


Campylobacter jejuni DZ72 hisJ 


^rchaeogiobus fulgidus AF2388 


40 


db Match 


i 

CN 

CD 1 

O 
CO 

c 
cn 




CD 
CO 
r- 
m 

CO 

CO 

CL 


co' 
0 
0 

CM 
O 
O 
LU 
< 

CL 
Ol 






1 


_i 
O 

X 
CL 

1 

Cl 
tn 


to 

1 

OJ 

X 
h- 
U 

CO 

d 
o> 




pir:S65770 


sp:YVYE_BACSU 


sp:THD1_CCRGL 






pir:S57636 


prf 2508371 A 


—i 

0 

LU 

I 

O 

cn 
< 
or 
0. 

cn 


LU 
"O 

< 
0 

1 

"O 

CO 
X 

CL 

i/> 


GO 
TT 

tn 

cn 

CO 

O 

CO- 






CO 


to 
0 

CD 


CO 

CO 

"7 

CN 


1023 


j 

en 1 
CT> 1 
ro 


CO cn 
cn co 


1056 


T 
O 


CO 

ro 


ro 
OJ 


1785 


to 

CO 


CO 

0 

ro 


O 

to 


to 
to 


1203 


3582 


O 
CO 


CO 
CD 
T 


CO 

cn 


45 


"ro 

.E 

E c 

QJ 
f— 


2234070 


1 2234763 


2234852 2237284 


j 2238353 


I 2238694 


J 2239845 
2240058 


| 2239508 


■^T 
OJ 

O- 

OJ 
CN 


co ■ cn 
ro | oj 

*- OJ 

? , ^ 
oj ! rj 

OJ OJ 


' 2244819 


ro 

cn 
ro 

OJ 

OJ 
OJ 


CD 
OO 
TT 

CN 
CN 


OJ 

00 

CD 
T 
OJ 
OJ 


to 

cn 

CD 
T 
OJ 
CN 


to 
0 
0 

r- 

TT 

OJ 
OJ 


CO 
tO 
ro 
CO 

T 

OJ 

Oj 


2252856 


2253659 


2254642 | 


50 


Initial 
/nt) 


CO 
CN 

cn 

CN 

ro 
CN 
CN 


2234158 


2237331 


2239092 


CN CO 

vr 

O CN 
O O 
T N" 
CN OJ 
OJ OJ 


ro ! 
to 1 
tn 1 
0 J 


CO 

to 
0 

vr 

OJ 
CN 


to 

OJ 
V 
OJ 
OJ 


cn 

U'i 

ro 

OJ 

XI 

OJ 
"M 


to 
ro 
0 
ro 

NT 

OJ 
OJ 


ro 
*r 
0 
ro 

N 
-N 
■N 


o- 
to 

N 
OJ 
OJ 


CO 
CO 
ro 
CO 
T 
OJ 
OJ 


O 

m 

*T 

ID 

'N 
"N 


00 

0 i 

OJ 
00 

-\ 

-N 


2251939 


"N 
Jj 

N 

^ I 


CN 

cn 
ro 

LO 

OJ 
CN 


LO 
"N , 




GO 2 5 


[5815 


CO 

00 

>n 


1 

*l 


5818 


cn 
»— 
CO 

JO 


O T- 

OJ OJ 
CO CO 

-0 tn 


oj ro 
oj o; 

CO ! GO 

to j to 


TT t 

OJ 

CO 

to » 


n 

-M 
30 

n 


5826 


"N 
X> 

r> 


5828 


□0 < 
"N < 
3D < 

n » 


■n < 

3D ( 

n * 


"O 
3D 

n 


O! < 

ro < 

CO ( 
to I 


■O 

0 i 


*r ! x 

"O t 
33 ' < 

n 1 


n j 
0 I 


55 


O n < 

t/j z a 


2315 


Si 
"O 


- , CO 

T> ; ro 

"N j CN < 


C7> 1 


3 j — 
"XI } OJ 

■n | ro 

"N OJ 


2322 
2323 


rr 1 
OJ { 

ro < 

OJ ( 


0 

N 

N 


2325 


■m 
0 

M 


2328 


30 , 
"N ( 
O c 

"N ( 


z> ', * 
<o , f 

n|c 


_ I 

*"> I 


2332 


•0 

0 < 

O i 

N ( 


^JJ4 

2335 1 



BNSDOCID: <EP 1 108790A2_I_> 



157 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



50 



55 




158 

BNSDOCID: <EP 1 108790A2_I_> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



O 3> rt 



o 

E 



o a? I 
ft? I 

-C CL , 



Oj CD 
CD O 



"to 



a> xr 
CO 



TO ^ . 
c — 



to 2: 



w 2 o 



CM 



> I 

CL 



to 

CO 



i E 



Of M 
*- VI 

CO 



- 3i 
c o 



II 



CD 

ai 
cn 



£2 

O c 



to I 

CD | 



LLf 

a 

CD 

I 

CO 



O 
cd 

CO 



cm 



3 



^ ? E 



TO TO £ ^ 

6 2 ex 8 £ 



E 



&3 



CD 



CD 

CO 

CD 



to 

to 
to 



o 

CO 
CM 
CN 



o 
CD 
< 



CO 

CN 
CM 



O 
CD 



CD 



— 5 

E E 

2 CD 



^ a 



CD < 



i - 

TO 5 

| v> 

E «= 

CO 

' u cn 
j TO *- 

' CD < 



CD 



to 
to 

CM 



CN 
CM 



CD 



TO 



E £ 

TO 

^ CD 
CU — 



TO *2 
-1 to 
=7 E 
CL ro 
Q 3 

3 CT 



CD 



o 

a Q 
E E 

=3 C7> 

n co 

CU CO 
o ^ 
CO *~ 

5 O 

So 

CD < 



E 
>» 



5 

CD 



co 
to 



CM 
CM 



- ! 



B cm 



to I 

in ! m ; 

cn I cn ; 

CM jCNj 



CD 



I _ 

I <P 



cm 
to 

ci 
CM 



Q Q 

e g : 

TO U 
5 | = 

E .2 - 
it) > 

T C 

75 0 _ 15 

U CM TO 
TO ■ 

^ E ~ 
D_ «J _ 

D D TO 
Z) CD TO 



cn 



o 



O 
(J 



< 
or 

5 



CM 

to 

CO 

to 



r\j . C) cd 



O 
U 

UJ 

l 

LL 

or 
=3 



CD 

to 

CD 



lO i CO 

CO to 

cn I cn 

CM , CM 



CO 

to 



cm j cm 



BNSDOCID: <EP 1 108790A2 I > 



159 



EP1 108 790 A2 



10 



15 



A3 <L> 



O Q 

if a 

% to, JS 

O CM 03 

Hi 

ID en rd 



20 



25 



30 



35 



40 



45 



50 



55 



T3 
CD 

C 



,C0 



-o 

c 



O 
O 
O 



OJ 
O 



3 

u 
< 

CD 
LU 

cr 
3 



E 



o 



Q. 



s 

"55 

cz 

-C <D 

1 £ 
i O 
O =1 

-a 



s ! 

Ql 



E 
E 



TO 



£ I 



.a 



CL 

-9 oj 



> LL 

m O 



E c 



> 
a 

CO 

O 



oj 

CM 



« c 



CO z 



o 

CO 



2 ^ 

8 > 

o cr 

m 
2 X 



cn 

OJ 
CM 



CN 



_ > 

<-> ex 

CD 

X3 > 

o cr 

o 



CO 

CM 

m 

o 



E 

<£ I" 

00 

y CP 



CM 



2 S 



E 
o 

oo E 



R 2 



cm ; 

CM | 



CD 
CM 



CD 

o 



OJ 

CM 



o 

s o 



a> 
CO 



TO 

E 



o 

Q- 
0) 



CM 
CO 



CO 



CO 



£2 I 



2 CM 
o cj 



GO 

CO 
CM 
CD 

a 



E ^ 

s > 

o cr 

o 

2 X 



O 
>. 

E 

C> 

£ TO 
CO CL 



3 O 
CM 

G) ■ 
CO 

S CM 



CD 

< 

CL 



CO 
CM 

m 
a 



co Z O 



v ' I 



ro 

CM 



oo 

CM 



CD 



CM 



CM 
CM 
O 



CM ! 

CO | 



OJ 



O 

co 



CM 1 



CM | 



160 

BNSDOCID: <EP 1108790A2J_> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



xz 

C 

c 
o 
o 



c 
u_ 



ro o> — * 



E~ 



£ 

o 
X 



45 



50 



55 



o 
ci 



E 
E 



CO 



CD 



1 = 



OJ 



cm 



> 

_Q > 

o cr 

>*CO 

2 X 



C3 



rv CL 



E 
o3 



CM 

to 

o 
co 

CM 



01 

To 
c 
o 
in 
o 

o> 

o £ 

■ii 

CO VI 
Q £ 

f I 

CO CL 



cm 

CD 



a. 



CD 
CD 



P 



g CM 

o O 
;>■. _) 
2 ^ 



CD 
O 



CD 
O 
CO 



c 

o 

a> 
c 



ro 

o cr 



o 
cn 
o 



2o»' 

C7) ^ ^ 



to 



CO 2 o , £ 



CO 

cn 
o 



co 

OJ 



oo 
CL 
c 

0J 
O 
Q 



CD O 
CO C/l 



CN 
CO 



e8 

I? 



to 
o 

= ID 

£ Is 



a> xi 



o 

cr 
O 
o 

i 

E 

CO 

o 



ro 

CM 



C 
QJ 
O 



CD CD 

co I t- 
o a> 



CO 



o 
o 



o 
o 
o 



E 



CO 



X5 ' 
CD 

U < 



tl> CM 
a> *~ 
U < 



o 

< 



CD 
CO 
CM 



CO 
CM 



CO I 



CO 
U0 



CO 



CO 
CM 



CO 
CM 



23 



™ o 



CO 
CO 
CO 



O CJ 

o in 

:>> O 

E o 

O *~ 

"MO 

CJ CD 

j= a 

CO CO 



(5 

CD 

o 

CO 



o 

CD 



£ O 

G> O 

O «« 

b» V) 

CL CO 



CM 



Is 

o o 



cr o 



o 

CM 



> 
CO 



o 
to 

CL 



cr 
O 

CO 



co 

CO 

cn 



CM 

co 

CM 



cn 
co 

CM 



CD 



o 

U3 



O 
m 

CM 
ro 
CM 



"5 J? 



X) o 

d .ir 



O J <D 

£ J 

Hi o 

•2 o o 
.cr o 

3 O 



CO 
O 
CM 



CO 



CD 

CM 



co 

CO 



CO 
CO 



QJ O 

.1? 



5 >' 

o cr i 

o r- 
>-ro 
2 X 



u 
> 



CD 



o 
o 



i - 

! O 



BNSDOCID: <EP_ 1 108790A2 l_> 



161 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



50 



55 



c 
o 



a> x: ^ 

co a> — ' 
2 - 



o 
E 



a: 

O : 



r- 

£ m 

CD 03 

V « 

g to 

E B 

ro -o 

1 E 

cn nj 



w O « i 
co ^ £ 



LO Z Q, . CN 



E 00 

X) > 

o (T 

>hco 
2 X 



I CO 
O 

a3 



O 



Cl 



CO 



5 < 

TO — 

4) ZZ 

o > 



X3 

E 
o 
E 



Cl 



<D 
00 
CD 

* is 

; §1 

i a s 

0) -D 

£ CL 
; CD tO 

! .E ° 

: T3 Dl 



CO O 
CD Z_ 

> <o 

O >- 



CM 
O 
CD 
< 



o 

CO 



< 



£ 

0) o 
™ CM 

o O 



CN 
CM 
CD 
O 



3 ID 

QJ xj 

y O 

Cl o 



LU 

Q 

LU 

CO 
LL 

I 

3 

m 

O 
(J 

CL 



CD 

o 

co 



o 

CD 

lO 



cn 
co 

o 



m | co 

m . — 



E 

CO Qi 

s S 

1 £ 



5 E 

XI CO 



CO 
T3 



CL 
O 



U 



Q) 
"O 

E 

re 
o 

CL 

I £ 



CD 



< 
O 

m 







u 








< 




re 




T3 




*3 




CL 




tfl 




CO 




C= 


< 


o 




om 


pe 


TD 


CO 


3 


co 


CD 


CO 


cn 




Q_ 





3 



o 

> 



LU 

> 



21 i co 



o 
=> 
Ql 
CL 

CL 

cn 



co O 



CM 

t 
CL 



CN 



< 

Ql 



O 
CD 
< 
CL 



CN 



CM 



<o o 



CM 



162 

BNSDOCID: <EP 1 108790A2_I_> 



EP 1 108 790 A2 



5 
10 




Function 


lipoic acid synthetase 


hypothetical membrane protein 


hypothetical membrane protein 


Uansposase (ISCg2) 




hypothetical membrane protein 




mutator mutT domain protein 


hypothetical protein 




alkanal monooxygenase alpha chain 
(bacterial luciferase alpha chain) 


protein synthesis inhibitor 
(translation initiation inhibitor) 






4-hydroxyphenylacetate permease 


transmembrane transport protein 


transmembrane transport protein 






15 




Matched 
length 
(aa) 


to 

CO 
CN 


tn 

CN 


CO 

to 
to 


5 

%r 




o- 
to 




L 

NT 


CO 
OJ 




o 

OJ 
OJ 








CO 
CO 
TT 


co 

m 


CO 


















































20 




Similari 
(%) 


70.9 


r-. 

CD 

r- 


67.8 


100.0 




o- 
co 

CD 




44.0 


65.6 




60.9 


p 

CO 






53.4 


72.8 


66.1 










Identity 


CD 
T 


45.5 


32.9 


o 

o 
o 




41.4 




31.0 


35.7 




25.0 


tq 
o 






21.9 


42.4 


314 






25 
30 
35 


Table 1 (continued) 


Homologous gene 


Pelobacter carbinoiicus GRA BD 
1 lipA 


Mycobacterium tuberculosis 
H37R\/ Rv2219 


[Escherichia coli K12yidE 


Corynebacterium glutamicum 
ATCC 13032 tnp 




Streptomyces coelicolor A3(2) 
SC5F7.04c 






Thermotoga maritima MSB8 
TM1010 




Vibrio harveyi luxA 


Thermotoga maritima MSB8 
TM0215 






Escherichia coli hpaX 


Streptomyces coeiicolor A3(2) 
SCGD3.10c 


Streptomyces coelicolor A3(2) 
SCGD3,10c 






40 




db Match 


< i 
o . 
_j i 
lu ■ 

CL 1 

< ] ! 

9r i 

CL 

*» 1 


p 
> 

1 

o 
o 

>- 

CL 

CO 


sp YIDE_ECOLI 


\ 

cn 

CO 

LL 
< 
CL 

cn 


1 


CO 

1 

r- 
U_ 
to 
CJ 
CO 

ol 
cn 






CO 

o 

CO 
OI 

r-- 

CQ 




< 

X 
CO 

> 

5' 

r 

bl 
cn 


O 

CN 

< 






X 
uo 

TT 

m 

CO 

o 

CN 
CN 

Tz 

CL 


o 

"~l 

ro 

Q 

O 
U 
to 

bl 

OJ 


o 

! 

CO 

Q 
O 

o 

CO 

ol 
cn 










§^ 


1 1 


O 
CO 


I 1617 


CO 

o 

OJ 


o 

o 

CO 


r>- 


ro 
CN 


m 
cn 


cn 
cn 

CO 


o 
o 

CO 


CD 
T 
CO 


co 
cn 
ro 


ro 
CN 


CO 
OJ 


1323 


CO 

tn 


T 

-tr 

T 


uo 
cn 


uo 
O 


45 




Terminal 
(nt) 


rr 

CO 
CO 

CO 

o, 


CO 

tn 

CN 
XT 
TT 

n 
Ol 


2346047 


cn 

CO 

Oi 
CO 

ro 

OJ 


2347804 


CO 

r*> 
o 

GO 

CO 
CN 


2350408 


2351996 


OJ 

T 

cn 
O 

tO j 

s I 


1 2351310 


2352828 


2353225 


2355398 


2355180 


2356843 


2357354 


2357707 


2357290 


2358130 


50 




CC ^ 

c 

£ "~* 


rr 
o 

CO 
CN 
rr 
co 

CM 


2343479 
r 


OJ 
TT 

T 

ro 
CN 


cn 

1! 


2347505 


CO 

O 
CO 
T 
CO 
OJ 


235062C 


2351022 


CO ! 

ro i 
CN ! 


cn 
o 
cn 

tn 
ro 
CN 


2351980 


2352833 


2355156 


o 1 

TT 

to 
to 
ro 

OJ 


2355521 


cn 
r-» 

CO 

to 

CO 
CN 


2357264 


co 

"^T 
r*. 
UO 
CO 
OI 


2357726_[ 






so 

co ^ 43- J 


5920 


5921 


5922 


CO 
CN 
O) 

to 


5924 


1 5925 


5926 


5927 


CO I 
CN j 

cn ! 

tr> 


cn 

CN 

cn 

UT. 


5930 


5931 


CN 

ro 
cn 
m\ 


5933 


ro 
cn 

LO j 


5935 


5936 


5937 


5938 | 


55 




O n <j 
L7) 2 o I 


8 1 
S | 


e 

CN | 


CN j 
CN i 
T 1 

r; i 


CO 
CN 

■c 

C>J 


CN 
CN 


to 

OJ 
XT 
OJ 


CO 
CN 

CN 


CN 
OI 


CN ' 
CN ' 


cn 

rN 

OJ 


o 
ro 
■^r 

OJ 


ro 

XT 

n 


s! 
-I 


CO 
CO 

rr 

OJ 


\r 
ro 

TT 
OJ 

! 


s ! 
» i 


2435 


ro 

CN 


CO 

ro 

T 

OJ 



BNSDOCID: <EP 1 108790A2J > 



163 



EP1 108 790 A2 



o 



10 



15 



20 



25 



ro a> 



CO 



X 

o 

o 

E 
o 



30 



35 



40 



45 



50 



C 

o 



Q3 



a 



o 

CD 

a 
o 

E 
o 
X 



ro 


QJ 


i= 


c/i 


o 


ro 


E 




E 




ro 


an 










ro 


>^ 


E 


:>> 


ro 


c 


j3 


de 


cn ro 



c 

E 
ro 

en 



o 
a. 



o 
5. 



o 

CO 



o £ 



a> LU 

CO en 



MSB8 




lima 




mari 




ro 
o 








Them 


glnA 



CO 

<f 

o 
a 



o ™ 

aTLU 
O 
CO co 



E c 



Q 

rr 

O 
o 

o 1 

2 



c to 

8 > 
o a: 

2 X 



CM 




ro 




< 








o 




o 




o 




"a> 




o 




o 




</> 
a> 


d 


o 




>- 




E < 


o 


m 


O) 


r— 
O 




O 


CO 


CO 



X 

<D 

en 

™ <D 

o ro 

II 

^"ro 



CL o 



C to 

o O 
.o ro 



o 



o 

CL 



O 

CO 



rr ;_ 



LLl 

X 



CD 



LU 

O 
CO 



55 



° O 

LU ^ 



CO 



W 2 □ w 







z 






<r 


MYCT 


< 

LO 


_HUM, 










o 




o 


o 


< 




CO 


9 


Cl 


CL 






o> 




1827 


o 

CO 


1293 



c 




a> 




"o 








Q. 






CD 


2: 


t/> 


CD 


JU 


aj 


ro 


B> 


-c 




Q_ 


ra 


SO 


□ 


n 


u 


CL 


CD 


ie- 


o 


E 




$ 


O 


o 





CD 

_Q 

— o 
C CO 

is 

XJ > 

o cr 

o r*- 
>» CO 
2 X 



E °> 
ro 

> 

o cr 

u 

^ ro 
^ X 



> 



5 2 
s > 

-O > 

o a: 
o r- 

^ X 



oi r-- 

CNJ 1 i- 



CM j CM 



. co 

. rr 

cn i cr> 

lO t LO 



GO 



CD 

rr ; rr 

rr ! rr . 
(N (N CM 



O 



o 



CM 



o < 

o 

cn "o_ 

So 
6 ° 

-It- 
i= cj 

LU I CO CO 



o 

rr 



o 
o 
cr 



< 

CL 



CD 



^ > 
_Q > 

o sr 

>- co 
2 X 



O 
>- 

I 

a 

o 



LU 

a 
cr 

CD 

J 



lO 
CD 



CT) 
CO 



in 



CO 



in 

CD 
lO 



tO 



Ilo 



164 

BNSDOCID: <EP 1 108790A2_I_> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



QJ 



QJ 



3 

CD 



e - 



T3 



£ 
o 
X 



o — 



c 

o 



_ <_> 
>^ cn 

£ ^ 

0J CO 

-t O 
CO CO 



CD 

U 
CO 



50 



55 



So 

CO 



LjOz 

CO Q 



o 

CO 



-O > 

o or 

I? 



O 



o 



co 

cn 
m 



fN 



to 
o 



in 

T 
CN 



O 
CD 



C c 

E o 

_2 cn 
cn c 

o c 

o -9 

tr c_ 

O lT 

CO < 

ra o 

~ Ol 

o ^ 

OQ co 

< -b 



o 

c 



GJ 












E 




qj 








E 




o 




CO 




>> 




in 




■c 




o 




CL 




(/) 




C 




cr 








<D 


c 


CO 


"5 


bo 


o 







to 
LU 



O 
LU 



CM 
CO 



ID 



— rv 

co ; to 

CM [ CM 



co 
CD 



C 
■o 

c 

E 
3 



o 

Q. 
ra 

Si 

QJ CO 

tr> 

y a. 



3 
CO 

o 
< 

CD 

I 

O 
to 
CD 



cn 

CO 



CN 



a 

o 

Q 

<■ 

LL 

CD 

u 



o 
co 

CD 



CO 

co 
in 



o 

CD 
O 



a> 
o 

E ™ 

o 

QJ CD 

^ o 
to to 



O 

CD 

o 

CO 



QJ 



j=; co 
i? « 
>» 

*Hj QJ 
o o 

' Q> 
2 XI 



o 



CO 
CN 



3%. 

o Ct - 
o 
o 

U CN 
O < 



o 
o 

CD 



O 



O CN 

a a> 

O T— 

c 

'55 or 
Q Q 



>< 



a 
< 



o 

o 

LU 

o' 
o 
< 



LU 
< 



o 



cn 

CD 



CO 
CO 



CO 
CO 



! ct> 

CO 



cn 



165 



BNSDOCID: <EP 1 108790A2_I_> 



EP 1 108 790 A2 



10 



15 



"en 
- c id 



a; 
a 
a. 



20 



25 



30 



35 



40 



45 



in 



c 

o 
o 



CD 



o 

X 



0) 

o 

>> oo 

e q 

o j^- 

t o 

a) to 



50 



55 



E 

*— • 



o 

GO 



< 



o 

3-55 

i o 

(/) CO 



u 

00 



to I cr> 
| co 



So|i 

W 2 S. ! 



S Si 
8 » 

-D > 

a 

2 X 



CD 
O 

CD 



-O 

o u 

X> 



E 2 
2 o 

7*"E 



CM 

xz 



" J CD 



o 
-cr 

CD 
CD 
ro 

CM 












E 




CD 




a> 




E 




CO 




E 


CO 






E 


"o 


cn 


ca 


m 


-Q 


to 


o 




o 


CN 




o 




E 



CM 

o 

< 



r- I oo 

OO ; CO 
O) O) CD 
LO ! lT) ID 



CM 



CN 
M 

< 

:> 
-z 



cn 
co 
oo 



CN 

CD 

o 



in 

CD 



CN I CN 



O 
T 

CN 



O 

CN 



CN 
CD 
O 



I CM CN 



166 

BNSDOCID: <EP 1108790A2_L> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



CD rc 



ra 



40 



45 



to 
co 

CD 



-o 



o 
o 
£ 
o 
X 



E£ 



^ > 

m tt 

TO . 

X) > 
O C£ 

>»ro 

s x 



CM 
CO 
CD 
O 



50 



55 



or 
O 



ro 

— CN 

-£ 05 

O O 
co 

2 CD 

Q o 



o 
o 

LU 

< 

CL 



GO 2 S 



(/) 2 Q I 



to 
cn 



JO 
0) 

si 



GQ 



3 J i 



o ^ 
ci. cn 



TO 

-Q 

E 



a 



E 00 

Is 

> 

O (T 

>>ro 



c oo 

5 > 

jQ > 

o a: 

cj 

>> CO 
S X 



CD 

n 



CN j 



CD 
O 



cn 
cn 

CD 



< 



o 

CD 



E ™ 

8 > 

o r- 
>, co 
^ X 



o 

CD 



lo 
t o 



JO 

E 
a> 

E 



o 



o 


"C 

c 




TO 


CO 




O) 






cn 






CJ 






C 






o 






CJ 




cn 


3 




<u 




















cn 




<u 


C 




CL 


GJ 






cn 




o 








an 


to 


rc 




<u 








a) 




O 


cn 


CO 


c 


cn 




cn 


a> 
2 


be 


rc 
-6 



CN 
CO 



.Is 

>*ro 
S X 



LU 
Q 



CL 



Q 
CD 
CO 



O 
CO 



O 

cn 

CD 



^ CO 
-- CO 

X3 > 
O IX 

<j r-«. 
>> ro 
2 X 



o 

CN 

ro 



cn 


o 


co 


CD 


CO 




CJ) 


Ol 


ro 




oo 


O 






CN 




T 




CM 


CN 


Oi 



> 

Cl 



CD 

o 
o 

CD 



CN 



CD 
CN 



O 
cn 
O 

CN 



CL 
GO 

CD 



ra 

.O > 

o or 

>* co 
Z£ X 



Q_ 



U O 
>s CD 

E 

O 

^ o 

00 CO 



CM 



O 
U 
CO 

cn 



LO 

CO 

CM 
CM 

I™ l 



o 

CD 



cn 
o 

o 

CD 



! 5 i 

JCDJ 



If} ■ LO j 

CM j CN ; 



167 



BNSDOCID: <EP„ 



1108790A2_L> 



EP 1 108 790 A2 



5 
10 


Function 
heat shock protein dnaJ 


heat-inducible transcriptional 
repressor (groEL repressor) 


oxygen-independent 
coproporphyrinogen III oxidase 


c 

o ! 
^ I 

c ! 
a 

E 
c 

o 

CO 
CO 

Is 

cn c! 

CO Q. 




long-chain-fatty-acid-CoA ligase 
4-a!pha~g1ucanotransf erase 


ABC transporter, Hop-Resistance 
protein 


Neisseria] polypepnaes preaiaea iu 
be useful antigens for vaccines and 
diagnostics 


polypeptides predicted to De useiui 
antigens for vaccines and 
diagnostics 


peptidyl-dipeptidase 

carboxvlesterase 


glycosyl hydrolase or trehalose 
synthase 


hypothetical protein | 


15 


Matched 
length 
(aa) 

q on 


I 

CO 

cn 

1 


o 

CN 
CO 


rr 

CO 




t~ CO 
t- CO 

CD ' — 


rr 

o 

CD 


CO 
CD 


§ 1 

i 


O CO 
O) LO 
CO rr 


rr 

o 

LO 


CO 
rr 
rT 


I 

20 


Similarity 
(%) 

77 4 


79.6 


64.1 


64.9 




75.1 
55.4 


rr 
CO 


51.0 


h- 

o 

CO 

to 


co 

co cn 
CO rr 


CD 
rr 
CO 


58.8 




Identity 

(%) 

47 1 


4B.2 


33.1 


36.6 




48.0 


29.5 


44.0 


47.0 


CO i- 
O rj 

rr CN 


CN 
CO 

1 CO 


32.1 


25 


























C 

c 
o 

30 

CD 
JD 

35 
40 


Homologous gene 


Vj 

■> <f 
5 o 

5 

n cn 
3 n 

T> CO 

</> 

a> o 
u o 

e e 

!o OO 


Bacillus stearothermophilus 
hemN 


Saccharomyces cerevisiae 
YNR044W AGA1 


_j- 


Streptomyces coelicoior A3(2 
SC6G10.04 


Lactobacillus brevis plasmid 
horA 


Neisseria gonorrhoeae 


Neisseria meningitidis 


Salmonella typhimurium dcp 


Mycobacterium tuberculosis 
H37Rv Rv0126 


Mycobacterium tuberculosis 
H37Rv Rv0127 


db Match 


ZD. < 
Oi CN 
rr -rr 

CO ■ CO 

CN ! Oi 
rr rr 
OJ OI 

t i *c 


< 

CD 

LO 
CN 
CO 

CO 
CN 

a. 


\ 

CO 

< 

LU 

> 

1 

< 

< 

CL 
tn 


I 


I 

! 

o i 

5 < 

co 

o : 

CO 
CL. 

cn 


-i 

D 1 

J CvJ 
U £ 
n,' & 

d ° 
< CO 

2 < 

CL CL 

co cn 


OJ 
CO 

rr 

> 

CL 

CO 

CD 


GSP:Y74829 


> i 

s . 

25 ! 

a' ! 

O 1 
Q • 

CL 
oi 


i 

■o 

N 

o CO 
cr oo 
r> cj> 
Z3 o 
±. ^ 
< 9 

CD CX 


CO 
CO 
OO 

o 
X 

\ 




'81 


1145 

! U^l«J 


o 
cn 

CD 


CJ) 
lO 


co co 
cj> r-- 
CD | co 


1845 


2118 
1863 


LO 
LO 
OJ 


ro o 
ro co 
ro t- 


; 204 

^1)34 


1179 
1794 


1089 


45 


Terminal 

(nt) 



2422700 
2423915 


UO 
CD 
CD 

rr 

oi 
rr 
CN 


2426699 


tD 

r- a 

r— ! cc 
CD ! 
CN CN 

rr rj 

OJ CN 


rr 
CO 
' — 
CO 

CM 
rr 
OJ 


CO o 
^- N- 
-cr co 
OJ *r 

CO ro 
rr rr 
CN CN 


rr 

CD 
CO 
CO 
rr 
OJ 


m o 

CO rr 
co rr 
ro ro 
rr rr 

OJ CN 


2434573 
2434805 


2438049 
2439906 


rr 

CO i 
CO 

o 1 
rr 
rr 
CN 


50 


Initiai 
(nt) 


in r- 
rr co 

CD CD 
CO "<T 
CN OJ 
rr rr 
CN OJ 


2425954 


oo 

CO 

OJ 
rr 
CN 


2427468 


2430028 


2430296 
2432508 


2433868 


1 I — CJ) 

O T- 

OJ CD 
I rr rr 
ro co 
rr rr 
OJ OJ 


2434776 
2436838 


2436871 
2438113 


2439906 






6012 
6013 


6014 


6015 


9L09 


CO 

3 O 

D CO j 


6C19 


L/U^- 1 


6022 
firm 


6024 
6025 


6026 

C007 


6023 


55 


SEQ 

NO. 
(DNA) 


2512 j 
2513 


i j 
i ^ 

j OJ 


I « 


2516 

OC 1 7 


2518 


cn ] o 
t- . OJ 
UO i lO 
OJ OJ 


OJ 
LO 

Ol 


2522 


> 1 T LO 
J 1 OJ CN 

> LO LO 
J OJ OJ 


2526 
2527 


2528 



168 

BNSDCCID: <EP 1 108790A2„I_> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



■o 

(D <U -5—' 

2 ~ 



I- 



3 
o 
cn 

o 

E 
o 



CL b 

o o 

to </) 



45 



50 



§? O 

"o .E 
w o 

£ >» 

TO o 

— >. 

co a 
6 8 

ro .£ 

<S E 
-O ro 



o 

CL ^ 
U ) Of 

c c 

TO o 

■o * 
o ° 

s £ 

E ~ 

ro e 

■E ^ 
ro q) 

u t 

p s 



2 Q « 



in 
o 
o 



I I 



55 



LU O 2 



CN 
CM 



2 £ Bl 



O 

o 
o 



to 

COQ 
O CN 

o 

o h- 
U < 



CO 
> 

CO 
O 

cr 
O 
o 



h 

(5 rN 

« s 

■2 £ 

GJ 

o h- 

o < 



in 

CM CN 



a 

cr 
O 
o 



cr 
co 



n I n I r> 



. o 
<d | <o 



cn 

CD 

co 

CD 



J cv m 
I n I ri I r) 

cm I cm | in 



cj 

Q..E 

I if 

O CL 



CL 

cn ez 



E E 



t; 
o 



O 

CD 
< 

to 

<u a> 

CL C 

o E 



O 
a 

LU 

I 

O 

a 
i 

o 

Cl q. 

cn | </> 



51- 



I CN 



O 
ID 



c 
o 



E 
a» 
cn 
to 

t: 

O L_ 

Cv o 
i/t 

C O 

5 s 

ro 

~ o> 
°- cr 
Q) t 

.9- £ 

•D Q. 



2 £ 
O CO 

E 10 

is 

X X 



a 
ro 

CD 



LU 
< 
X 

<■ 

CL 
CO 
X 



CO 

a 
< 

CO 

co' 

Q_ 
CL 
< 

Cl 



CO o 
O £X 



o 

a 



o 

CL 
CL 

o 



o 


a 


Cl 




Cl 


CL 


TD 


o 


CN 


CN 










O 


"6 


o 


u 


rc 


ro 


xr 


IE 




o 


a3 


QJ 


x: 




u 


a 


cn 


cn 


LU 


LU 



cr 



o 

CD 



i i I 



o 

T 



ro 

CD 



O 
CD 



O 
CO 



o 

CD 



BNSDOCID: <EP 1108790A2J_> 



169 



EP 1 108 790 A2 




170 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



,co 



3 



CD -C ^ 

§ 13 



CO 



O 



CD 



St 



E 10 

-O > 

o a: 

S X 



o 

5. 

5= 
cd 

CO 

o .E 
I o 
CO! £ 



a. I 



CD 

o 
cL 



CD 



E ™ 

o r»- 

CD CO 

i= a 

CO CO 



50 



55 



W 2 



CD 
CO 



o n <: 

CO ^ O i 



a 

CO 

o 

CO 



CM 



™ > 
-Q > 

o CC 

>» CO 
S X 



co 



o 3 



o 



« CD 
CD 

E c 

O -Q 



C 

2 « 

O 3 

° £ 
c « 

<u c 

Q.T3 
E E 
O -O 



CD 



3 
CO 

U 
< 

CD 



CO 
m 
co 



o 

LU 


< 
LU 


E 


E 


o 


o 


<_> 


o 


CO 


CD 


CO 


CO 






«/> 


to 


25 








to 


</i 


</> 




_tj 


.2 


o 




aj 


CD 


CO 


co 



3 

CO 

u 
< 

CJ 



CO 
CO 



0) CJ 

:>> o 
E co 

O CN 

S~ CJ 

Jr CJ 

CO CO 



E 2 

-O > 

o Ct 
o 

2 X 



O 
O 
CO 



I o 
; in 



— o 

E ° 

jO > 

o DC 

o 

>> ro 

S X 



CO 
O 

CD 



o 

CD 



St 2! o 

2 2-6 

I ?5 

3 o >, 
§ -o E 

«» to <u 



CD 
O 
<J 
cn 

a> cj 

E po 

O IN 

^ o 

co co 



u 
a 
co 



in 



o 

CO 



TO 

g* 

o 



CL C 

o 



X) 

• cu 

Q T3 



O 

a. 



CD 



E 

CJ 

*E 

03 

E o 
<D cn 

cu 

<rO 
c^O 
O h- 
O < 



CD 

O 
o 

<■ 

O 
tx 

CL 



Jl 

o h- 
O < 



CD 
a: 
O 
u 

<■ 

a: 

Q_ 



co o 



CO 

o 



cn 
o 
o 



o 
CO 



BNSDOCID: <EP 1108790A2 I > 



171 



EP 1 108 790 A2 



5 
10 


Function 
xanthine permease 


2,5-diketo-D-gluconic acid reductase 


SOS ribosomal protein L27 


SOS ribosomal protein L21 
ribonuclease E 


hypothetical protein 


U 
J 

D 

3" 
U 

5 
c 

CU 
m 

o ro 

CL 00 

to 

1? 


hypothetical protein 


hypothetical protein 
nucleoside diphosphate kinase 


hypothetical protein 


I hypothetical protein 


hypothetical protein 




15 


Matched 
length 
(aa) 

49? 


to 

r- 
CN 




CD 

O CO 
«- CO 


in 
cn 


CO 
CO 




co 

tt CO 


CN 
CD 


CN 


CO 




20 


Similarity 
(%) 

7(3 


81.9 


CN 


82.2 
56.5 


i 

ID 
CN 
CO 


1000 


76.9 


67.8 
89.6 


r*- 
CO 


64.3 


68.6 






Identity 


• | CN 

) *~ 

) CD 


CO 

O 
CO 


56.4 
30.1 


61.0 


99.1 


51.3 


37.8 
70.9 


34.8 


36.6 


33.9 




25 

CD 
C 

o 

30 

0) 

Xi 


Homologous gene 

~ . ..ui:i:. 4CO ~W..Y 


U 
3 < 

8 & 

- £ 

1 « 

in CU 

- c 2 

- >, CD 
O ^ O 
CO o 

3D On 


Streptomyces griseus IF013189 
rpmA 


Streptomyces griseus IF013189 
obg 

Escherichia coli K12 me 


Streptomyces coelicolor A3(2) 
SCF76.08c 


Corynebacterium glutamicum 
ATCC 31831 


Streptomyces coelicolor A3(2) 
SCF76.08C 


Streptomyces coelicolor A3(2) 
SCF76.09 

Mycobacterium smegmatis ndk 


Deinococcus radiodurans R1 
DR1844 


Mycobacterium tuberculosis 
|H37Rv Rv1883c 


Mycobacterium tuberculosis 
H37Rv Rv2446c 




35 


























40 


O 

-a 


D 

O 
< 
m 

X co 
3 co 
CD ° 

a ? 

CL 


cr 
o 
q: 

fr- 
CD 

.J 

CN 
—1 

QL 

a. 
<n 


< o 

ro CJ 

CD LU 

s £ 

CO ^ 
CN K 

XL CL 
CL tO 


co 

\ to 

Ll. 
O 
CO 

i s 


CO 

CO 
CO 

CL 


gp:SCF76_8 


1*- CO 
LC o 
O LC 

l/D < 
cL b 
i cn o 


gp:AE002024_10 


pir:H70515 


pir.E70863 


I 




Si 


1887 
621 


CO "«T 
CT> CO 
CO CN 


303 

2268 
~549 


573 1 
747 

609 


1308 


GO 

co 


j O CO 

in o 


O CN 

to 

CO CO 


in 
to 


CO 
CN 
T 




45 


Terminal 
(nt) 


cn lo 

CO to 
CD n- n 

CO 

O j o c 
m I m lt 

CN 1 CN CN 


2504265 
2503984 


2504300 
2504831 


2507710 
2508840 

2509530 


2509523 


CO 
CN 

tn 

| CN 


to cr 

CO CJ 

in if 

CN I O 


2512409 
2513144 


2513154 


2513692 




50 


Initial 
(nt) 


2499783 
2502577 


2503870 
2504247 


CN CU U" 

O Cn ^~ 
to o *■ 

o o c 
I cn m j ir 

CN CN | c 


) CO CN 

- CO O CN 

- «- O CJ) 

, r«- co co 

3 O O O 

> in in m 

vj CN CN CN 


2510630 


2511046 


251 1427 

OC. 1 0^*>f\ 


2512768 
2512803 


2513618 


2514114 


i 




SEQ 

NO 
{a a.) 


6085 
6086 


6088 
6089 


6090 

5091 
enno 


si 1 co t m 

D O) Ol CD 

J , CU CJ o 

n j co to to 


9609 


! 

I 6097 


8609 


6100 
6101 


1 - 
6102 


6103 


55 


SEQ 

NO. 
(DNA) 


m to ' r 
co co : a 

m in u 

| CN CN r 


- co cn 

□ CO CO 

i in | in 

N j CN j CN 


2590 
2591 

ocno 


m ! co in 
r> ct> cn m 
n • in in in 

N CN CN CN 


1 s 

CD 
1 ^ 
1 " 


I 2597 


2598 


n O 2Z 
T> o o 
O CO to 
N CN CN 


2602 


2603 



BNSDOCID: <EP 110879OA2J_> 



172 



EP1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



50 



c 
o 



« C m 

co co *— - 



E ~ 
CO 



CO o 



X) 
T3 



O = 



1-E 



2 


5: 


co 






cn 


CO 




CD 





co 


to 




r*- 






CM 


cn 




to 


to 








m 


in 


cn 


CM 


CN 


CN 



1 2 Q «l 

' CO — -2- 



to ! 
to | CM 
m j to 



CL.iz 

So 

< -D 



0, 

OJ CD 
CL 

O CO 

cn -O 

— 3 

O to 



sr 

o 



CD 



CM | CM 



in I to 
o ; o j o 

CO j to ! O 



I W 2 O £ 



in ; co 
o i o 

CO I CO 



"is 



tO 

GO 

O 
UJ 



cn 
ro 



to 



cn 




CN 




cn 




CO 




CO 




O 




O 








< 




tO 




3 




O 




"Po 




3 




C7 




03 




cn 




3 




E 




her 


MP* 


h- 


t 



ID 

X 

I 

X 

c 

2 



o 
to 



to I to 

CN j CN 



CM 




cn 




< 








u 




o 




o 




~o> 




o 




o 




CO 




CD 




O 


CO 


CO 


£ 


o 


o 




a 


< 


QJ 






O 


CO 


CO 



TO 
O 

__ CD 
O 

£ o i c: 
-c ro a; 
Q-c j o 



IS 



O 

co 



CO 
CN 



> 

cc 

to 
fC 

go 
8.°- 



- CO 



r CN 
CO CO 



CO 



to o 

15 a- 

I c 

_ d» 

— t=> 

• c: 

tO QJ 

m a- 

J2 CD 



C3 -O 

O 3 

X CO 

? CO 

co "3 

< -Q 

O ai 

O ^ 
co 

>^ ir. 

c ^ 

o to 

u £ 

3 CO 



to 
to 



CO 

o 

CO 
CM 



£ CN 
U LO 

O 
CO CO 



ll 

o 

CO 



to 
co 
o 



to 
to 





cn 


o 




si 


i — 


CN 


CN 










to j 


tr> 


CO 


to 


CO j 


cr> 


O 








CN 


CN 


to 


to 


CO 


CO 


CN 


CN 


CN 


(N 



< 




O 




U 




0) 




ipa! 


c 

3 


x» 


X) 


ro 


3 


o 


to 


X 
O 


to 




JZ 


to 


Ql 


< 


CO 


o 


<u 


O 


to 

to 


nyl 


.2? 


o 


*tn 


o 


c 


3 


to 


to 





CN 




! 

CO 


1 

CO 


UJ 


aj 


CO 


CO 


cn 


cn 


O 


O 






LL 




< 


< 


cL 


CL 


cn 


cn 


CO 


o 


CO 


m 


CO 





BNSDOCID: <EP_ 1108790A2 »_> 



173 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



ca a> • — 
2 



CO 



§ 

cn 



(D X </) 

o rc 

— <n 



Y c a> 

CO CO "D 



O. 

12 



o 

CO 



0) x t/> 

O TO 
£ -P >> 



to 

CO 







CM 




CO 




< 








o 




o 












o 




o 




m 




a> 




o 








E 


o 


o 






i 




o 


CO 


CO 



i o ™; 



55 



CM 



CM 



a 

CO 



o 

JZ 

or 



CM 
CO 
CM 



LU 








CM 




CO 




00 




O 




t 




CM 




a. 




CO 


CO 


CO 




CO 


CD 



« S3 



5 

o -Q 



o 

<D 



o 3 

o -e 



CD 
CD 



co 

O 



CD 
O 



c 

=3 



aj I 



f CO 

.5 8 
" cr 

_Q > 

o ex 

CJ 

5k CO 

2 X 



u 

T 

CM 
CO 
00 
O 
T 
CM 



CM 



CM 
CO 



o 



-a 

CM 



Cm 

CD 



£ ■ 

o 
a 

CJ 

(SI 



1 ? 

CL Q. 



a 
O 
O 

x 
a: 

co 

r— 
< 

o 



< 

CO 



CM 

t: 

CL 



CM 
CM 

o 



cm 



in 

CM 



CO 
LO 



CO 

o 

CO 



cn 

CM 



CD 
CM 



CO 
S 



«- CM I CO 

CO Ci \ CO 

CO CO I CD 

CM CM . CM 



to 

CM 



CO 
CM 



CD 
CM 



174 

BNSDOCID: <EP 1 108790A2 J. > 



EP 1 108 



790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



Q3 
£3 



-£3 tr> re 

C re 
re o — - 



§ - 
CO 



'c 


,_ 

c 


ZJ 




-Q 




3 






to 


o> 


Gj 


in 


CO 


re 


TO 




C 


«D 


CD 




CD 


5- 


>•» 




V 


O 


c 


T3 


■o 


CN 


rv 




T— 


CD 


QJ 


ni 


re 




ZJ 


O 


o 



re 

o x 

x> 

CL cl 



o 

CO 



3 

CL 
CA 

O X 
"D 

CL CL 



2 

X) 



n 



50 



55 



x: 

O QJ 

cM3 



. re 



a; 
cn c 

2?S 



b « 



o 

°£ 

CX O 

1- °- 

^ 01 

O c= 

, CX TO 

C 

re <l> 
To 

<D O 

£ 5 

E £ 

c 2 



re 

o 5T 

CL cx 



CO 



o 



re 



m 

CD 
LU 

tr 



re 

-Q 

o 

|s 

< CX 



< 

O 

o 
< 

< 
u 

Q_ 



CD 



o 
a. 

CXCN 

a 1 

QJ O 

cx 

QJ 

T3 O 
■ O 

P= s 

< CX 



o3 



< -o 



o 

<L> CL 
Z3 CX 
CO o 



0) 

o 
cx 
_cx 

O E 

QJ U 

-D O 
' CD 



< CX 



a> CL 
w cx 
CO <j 



< 

o 

u 
< 



CD 



O 

CO 
CO 



in 

CM 



LU 5rJ 
CO ^ 



o n <! 

LU § £ 
CO 2 Q , 



CD 



O 

Li. 
< 



CD 

CM 



CL 
CD 



CO 
CO 



co 
co 



00 
CD 



CO 
O 
< 

CO 

I 

O 



in » 
m 

CN» 



E - 
o to 



CO CO 



o 

CO 



o 5 

2: cx 



3 

a 
o 
2 



CL 
CO 

CL 



CD 
LO 

in 



to 
m 

CM 



o 



CD 



CD j CO 
CM CM 



Ct 
O 



m co 
to 



m m 
co j co 



BNSOOCID: <EP 1 108790A2_I_> 



175 



EP1 108 790 A2 



5 
10 


Function 






<L> 
to 
(0 

tB 

E 
o 

0) 

"55 
x: 

CL 
to 
O 

_c 

CL 

CO 

o> 
o 
"o 
j5 
no 
cn 


hypothetical protein 


hypothetical protein 


aminopeptidase N 


hypothetical protein _ 








phytoene desaturase 






phytoene dehydrogenase 


phytoene synthase 


multidrug resistance transporter 




ABC transporter ATP-binding protein 


dipeptide transport system 
permease protein 


nickel transport system permease 
protein 




15 


Matched 
length 
(aa) 






o 


CO 
CM 


cn 
cn 


o 

CO 

co 


CO 
lO 
CO 




1 




M" 
O 






GO 
CO 


o 
cn 

CM 


CN 

o 

CO 


CO 
CO 
LO 


CO 
CO 
CN 


to 

CO 


















































20 


Co 






71.4 


58.1 


CO 

o 

OO 


70.5 


58.1 








81.7 






63.8 


58.6 


47.7 




71.6 


73.6 


62.0 




cn 














































Identity 
(%) 






40.0 


26.2 


56.8 


47.5 


25.1 








61.5 






31.2 


CO 


CO 

LO 

CM 




CO 
M~ 


38.8 


CN 
CO 
CO 




25 

0) 
C 

c: 
O 

30 *- 
Q) 

35 


Homologous gene 






Staphylococcus aureus NCTC 
8325-4 lacB 


CM 

LL 
LY 

o 

CO 

D 

O 

J2 
3 

CL 

o 

TO 

CO 
lO 

O 
TO 

m 


*/i 

cn 
o 

r5 
u 
cu 

X2 

— ' O 

c to 

s > 

O QC 
XI > 

o OH 

>> CO 

2 X 


Streptomyces lividans pepN 


Borrelia burgdorferi BB0852 








Brevibacterium linens ATCC j 
9175 crtl 






Myxococcus xanthus DM 050 
carA2 


Streptomyces griseus JA3933 
CrIB 


Listeria monocytogenes lltB 




Synechococcus elongatus 


Bacillus firmusOF4 dppC 


Escherichia coli K12 nikB 
















































40 


db Match 






$ 

j— 

CO 

i 

m 

o 

o_ 

CO 


sp:YAMY BACAD 


pir A70B66 


tr 
i — 

CO 

1 

2f 
CL 

< 

CL 
CO 


pir B70206 








CO 
(O 

T— 

cn 
cn 

CO 

LL 
< 

b- 

CJ) 






> 

i 

! —> 
t— 
cr 
o 

CL 
CO 


cn 
C5 
rr 
h- 

CO 

i 

CD 
h- 

cr 
o 

id. 
cO 


ro 

I 

r- 

CN 
CO 

cn 
— > 

< 
-J 

CL 

cn 




CM 

i 

Cl 

m 

CL 

h- 
< 

O 
>- 

CO 
CL 

cn 


U- 

U 
< 
CD 

1 

O 
Q_ 

a 
a 

CL 


CO 
CO 
CO 

CO 
6- 






n 


o 

OJ 

co 


LO 

cu 
oo 




CO 
OJ 

to 


cn 
o 

CO 


2601 


1033 


1152 


CO 
CO 
CO 


CO 

tn 


r- 

CM 
co 




cc 
r- 
co 


1206 


CO 

CO 


1119 


1233 


1641 


! CN 
» CO 

! 00 


cn 

CO 

cn 


1707 


45 


Terminal 

(nt) 


2562387 


2563847 


2563932 


2564550 


2565623 


2563945 


2570293 


2570309 


2572175 


2572348 


2572351 


1 

2572807 


2573393 


2572659 


2573843 


2574780 


2575981 


2577232 


2578879 


2579769 


2530711 


50 


Initial 
(nt) 


CO 

in- 

< CNJ 

cc 

LP 
CN 


0" 

tc 

<T 
CN 
CC 
LT 
<N 


2554402 


2555245 
2566231 


2565345 


2569211 


c 
cr 

r^- 

IT 

CN 


2571510 


2572193 


2572677 


2572977 


2573770 


2573864 


2574718 


CO 

a» 
cn 
in 
r». 
tn 

CM 


2577213 


2578872 


2579760 


2580707 


2582417 




So; 
co ^ « 


- cr 

ij s 


C 
CC 

> CC 


> •«— 

) CO 

> to 


o 

cc 
cc 


6163 


tc 
tc 


LO 
CC 

CC 


6166 
R167 


CC 
CC 

cc 


6169 1 


o 
r- 

cc 


cc 


6172 


CO 

co 


6174 


m 
r-- 

CO 


6176 


6177 


CO 

I Id 


cn ; 
r- : 

CO 


55 




> ol - 

> CO CO 
) tO CO 
J CM CM 


r> 

iC 


2563 


CC 
CC 

r> 


LT 
» CC 

) cc 
j r\ 


> co i r- 

> CD tf 
) i CO CC 
J 1 CM O 


cc 
) cr 
) cr 

J O 


2669 


c 

r- 

cr 

CN 


2671 
2672 


2573 


tc 

CN 


LO j CC 

<D . CC 
CM j CN 


2677 


2678 


(2679 



176 

BNSDOCID: <EP 1 108790A2_1_> 



f 

EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



O 



CO 



rv 



E cn 



CD CN 

c i- 
O < 



CD 

o 
o 

o' 

O 

< 



o 



E 
E 



JD > 
O LT 

>*co 

2 I 



20ojg] 



38*181 



.Si > 
o 

ra ^ 

o ct 

>» CO 

^ X 



O 

" I— 

>*; ° 

0] 



t 



o 



< 



°- c £ 

S^S 1 cn 



p 
o. 
o> 
c 
r> 
c 

-C 

0l 

< 

q3 
n 
o 



O 

CO 

< 



3 < 

0) CN 



CL 
CO 

O 



LL 

< 



cn 

CO 

CD 

CN 



CO 



to 



rr j in 
co i ac 

to j cc 

CN fv,« 



Q> O 

o O 
2 S 



CD 

U 



c 
o 
o 

Q. 

TT 
o 

Q_ 
i/) 

c 

TO 



p 

"TO 



ro o 

in W) 

"I 

F ^ 
o 

"5 E 

ss 



< 

LU 
00 

a. 

<' 

cr 
x 

u 



O) 

co 



cn 

co 
to 



8 > 

-Q > 

o cr 



o 

CL 



CD <-> 
O CJ> 
>s 

E © 
o <«- 

Q.O 
CD CD 

L) 

CO CO 



CM 
CD 



CN 
cn 
tr> 
CN 



CO j co 
5 j CD 



Q 

CD 

O 
CO 



i - 



co | 



> 



^ CO 



Is 

o rr 
ro - 
x> > 
o cr 
u rv 
>•» ro 



CL 
0) 



CO 
CN 



CO 

o 



o 

CO 



CO 

o 



BNSDOCID: <EP_ 1108790A2J > 



177 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CO <D — ' 



E 
o 



c 



9-5 



_ c 

CD 



2°i 



il 

3 E 



o en 

CL CD 

m 2 



3 

! 

o 

CO 



(0 

3 E 

a tZ 

S tr 

55? 



ce: 
\ — 

CO 

I 

u_ 



E 

CL 



a> ^ o 



< £ 2 



E 

v> 
O 

£ c 

qj CD 

° o 
o a. 



J=> LLt 
ro to 

S3 
II 



o 



0) 
fO 



E 

o 



co t- 



P ! 



o 

CD 
T 

o 

CD 



a ~ < 1 115 

LU J~ ^ CO 



o 

CM 

in 

o 

CD 

CM 

O 

o 
rvj 

CD 



co 



to 



r-- 
o 

CM 



£ E 

CO "O 



< 

CO 
CO 



t: 



o in 
5 ° 



3 f— 

O ry- 



O 
CM 

co 



11 

CO o 



ro 
co 

to 

CM 
t 



CM 

cc 



O i o 



< 

CO 



CN 
CO 



o 
in 



e a> 



co | • 



E 

.5 in 

5 > 

_Q > 

o a: 

o r-~ 

2 X ' 



co 

CM 



o 



CM 
CO 



o 

r- 
CM 



co ! 



O 
O 

LU 

I 

]Z 
LY 

o 

oi_ 



co 

CD 



CO 
CM 



CM 
CO 



BNSDOCID: <EP 1108790A2J_> 



178 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



c 
3 



■o 

°* £ — * 
5 - 



I? 3" 
I- 

CO 



50 



55 



E 

o 
X 



n. 



o 

Q- 



cn 



E » 
_o > 

o cr 
fx 



O 



2° 2 

co 2 « 



CO 3 Q OJ 



O 
O 
CM 



cn 
o 

CM 

co 

CM 



&2 
S ™ S 

o I — I 
U < 



3 
O 
CO 



o to 

; cm 



c 




o 




^0 












o> 




cu 




T3 




U 




c 




u 


c 


O) 




CL 


GJ 


to 


O 




Q. 


O 




CO 


O 




CO 




Z3 


O 
CL 


CD 
GJ 


V) 





m 
to 



c 

o 

co > 



in 

o 

T 

CN 
CD 



i ^ 
to 

CN 



cr> • o * 



CN . CN C\l . 



CN! 
CN 
<-D ' CD 



CN i cn 
CM . CN 

CN CN 



CO 

cn 
co 



LU 

o 

< 

CL 
CO 

cf> > 

r» LU 



C LU 
t/i _J 

co <C 

a: q 



CN 
CN 



*' 

V) 
_J 

o 



i co 
t co 



CN 
CO 



0) 
TO 



O 

in 



o 



O 

CN 



o 
o 



E P 
5 



! in 

-Q > 

o Ql 
>-co 

2 X 



2 | 

CL 



o 



co 

CD 
CN 

n 
o 

CN 



CN 



! ^ i 



ID 
CN 



I! 

CL ^1 
<D — 

iz a 

CO CO 



o 

CO 



<n| 



179 



BNSDOCID: <EP__1 108790A2 I > 



EP 1 108 790 A2 



o 



10 



15 



20 



TO <U 

o c 



c 

QJ 
O 
Ol 
(1) 

u 
c 



"S £ „ 

SI'S 



CD 

to 



25 



30 



35 



40 



45 



50 



"O 
CD 
Z5 
d 



O 
O 



,03 



O 

cn 
_o 
o 

£ 
o 
X 



Q) C 

5 .<? 

0) c 

>> £ 
o F 



^ CD 
O 

o 



co 
o 

LO 

< 

CO 



O 



55 



C7> 
CN 
O 



CO 
CO 



£ « 

c o 

E 

O £ 
O to 



CD 

o 

£ - 
o r>- 
cL< 

CD 

±z O 
CO to 



So " 



I cn 



° n5 

uj y ^ 

co ^ G. 



-O 

E 

<u 

E 

ro 

o 

Xj 
O 



E 
E 



x 
a. 



_q 

3 

— O 

c o 

is 
S > 

ro 

o > 
o Of 
u r- 
>*tr> 
S X 



CN 

d 



o 



CM 

o 

CM 



CO 
CM 



o 

CO 



o a: 
ro ^ 

-° £ 
o CC 
o r- 

2 X 



.2 to 

O LI 

ro t 
-O cn 

O 

CJ ID 
>^ ,- 
S CD 



ro 

> 

o 

o r» 
S X 



to 



O 
> 



o 

> 



CD 



o 



! ° 
>- 



o 



ro cn 
c cn 
O CD 

E £ 

"5 ° 

S h- 
a < 



CD 



O 

CD 
O 



! ,~ 



UJ 

< 

Hi 
00 
CL 

I 

X 

a 
z 
tr 



CN 
CD 



CN 
CD 



CN 
CD 



O 
T 

CN 



CD 




CO 


<r> 


cn 


o 


o 


CN 


LO 


U"> 


CD 


CD 


CN 


CN 


CD 








CN 


CN 


CD 


to 


CO 




T 




f>- 


i r- 


CN 


i CN 



xz 
o 



CO 

CN 



CO 
CN 
CO 



LD 



a 

o o 
-~ o 

£ CD 

3 < 

CO 

5 o 

ro 

jO > 

o tr 

a 



CD 
CN 
O 
> 



E m 

3 a. 

E ^ 

5 < 
cn Cl 

E 12 

Jj ro 

cj — 
ro 

x» cr 

cu 

c co 

^ CN 
O CN 
O CN 



CN 
CD 



O 
co 
CN 
CD 



ro 
ro 



O ; 
>- I 



o 

CD 
CD 



cn 

CD 
CO 



t- CN 

CO co 

- i Cn 

i CD 



180 

BNSDOCID: <EP 1 108790A2_I_> 



EP1 108 790 A2 



5 
10 


Function 


D-glutamate racemase 




bacterial regulatory protein, marR 
family 


hypothetical membrane protein 




endo-type 6-aminohexanoate 
oligomer hydrolase 


hypothetical protein 


hypothetical protein 




hypothetical protein ! 




ATP-dependent helicase 


hypothetical membrane protein 


hypothetical protein 


phosphoserine phosphatase 




cytochrome c oxidase chain I 




15 


Matched 
length 
(aa) 


CO 

CM 




r-- 

M" 


m 

CM 
CM 




CM 
CO 


o 
o 

CM 


in 
o 




CO 
CM 

1 




CO 


CO 
CO 


CM 
CN 
CN 


o 

CO 


! 

! « 

1 tn 

S 












































20 


Similari 

(%) 


99.3 




70.8 


69.3 




58.3 


58.5 


77.1 




80.8 




53.3 


60.1 


52.0 


61.0 




T 

N~ 






Identity 
(%) 


99.3 




44.2 


38.2 




30.2 


35.0 


57.1 




61.2 




25.2 


29.7 


39.0 


38.7 




46.8 j 




25 

o 

=3 
f 

"c 
o 
o 

30 ^_ 
cu 

35 


Homologous gene 


Corynebacterium glutamicum 
ATCC 13869 mur! 




Streptornyces coelicolor A3(2) 
SCE22.22 


Mycobacterium tuberculosis 
H37Rv Rv1337 




Flavobacterium sp. nylC 


Mycobacterium tuberculosis 
H37Rv Rv1332 


Mycobacterium tuberculosis 
H37Rv Rv1331 




Mycobacterium tuberculosis 
H37Rv Rv1330c 




Escherichia coli dinG 


Mycobacterium tuberculosis 
H37Rv Rv2560 


Streptornyces coelicolor A3{2) 
SC1B5.06c 


Escherichia coli K12 serB 




Mycobacterium tuberculosis 
H37Rv Rv3043c 




40 


db Match 


prf:2516259A 




gp:SCE22_22 


D 
\— 
O 

i 

CO 

o 
> 

CL 
l/l 




pir;A47039 


sp Y03H_MYCTU 


sp:Y03G_MYCTU 




J— 

O 1 
> 

1 

LL_ 

CO 

O 
> 
CL 

CO 




prf:1616252A 


sp:Y0A8_MYCTU 


pir:T34684 


sp.SERB_ECOLI 




pir:D45335 








CM 
LO 
CO 


CO 

ro 

CO 

i 


CM 

cn 
-cr 


■er ■ 
r-. 


1 

cn 


o 

CO 

cn 


r— 

CO 

in 


o 1 
o 

CO 


CM 
CD 


1338 


CO 
O 

CO 


o » 
nt : 
| 


891 


CO 
CN 

r«w 


1017 


19691 


CO 


CO 

o 

CO 

r 


45 


Terminal 
(nt) 


2658606 


2660131 | 


o 

CO 

CO 
CM 


2660671 


CO 

m 

CM 
CO 
CO 
CM 


CD 
CO 
CM 


2662331 | 


2662883 


2664060 


2665397 


2565992 


2667854 


2667870 


2668839 


2669557 


2672721 


2671063 


2573255 


50 


Initial 
(ntj 


r-» 
lO 
rr 
cr> 

CO 

to 

CM 


CO 

cn 
cn 

CO 
CO 
CM 


2660638 


r- 

CO 
VO 
CM 


m 

CO 

o 

CO 

o 

CM 


2662376 


2662867 


2663182 


2663437! 


o 

CO 

o 

CO 
CD 
CM 


2665687 


2666115 


2668760 


2669561 


2670573 


2671126 


2672805 


2672950 




I SEQ 
j NO 
(a a.) 


6253 


T 

CO 
CM 
CO 


tn 
in 

CM 

to : 


5256 1 


5257 1 


5258 


5259 


6260 


6261 


6262 


6263 


CO 
CM 

co ! 


6265 


6266 


f-x I: 

CO 
CN 
CC 


6268 


6269 


6270 | 


\ 


G rS % ' 
uj v-* t: • 

W 2 Q ; 


2753 


1 

SI 

cm! 


>n 1 
cn 1 
r- i 
r* | 


2755 


2757 


CD 

in 

f— 

CM 


2759 


2760 


2761 


CM 
CD 

r- 
cn 


2763 j 




m 

CO 

r- 

CM 


2766 


2767 


2768 


CD 
CO 

r- 

CN 


2770 [ 



BNSDOCID; <EP 1108790A2_L> 



181 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2, l_> 



182 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



c 
o 



5 0> ctJ 

« g a 

5 - 



0) 



C 
O 

o 



o 

cn 
_o 
o 
E 
o 



X) 

6 

CD 

E 



O — 




E r 



50 



55 



o 
a. 



CO 
CL 



O 



<J i- 
aj o 



re <D 
E o. 



to 
ai 



e8 

3 ^ 
|f 

cr> _m 

e r 

a> •— 

1 - Q. 

2 ° 

gf * 



c .£ 
o o 
o o 
a. cl 



O 
a 

i 

n~ 

CO 

(J 

Cl 



! ^ 

I ° 

cn 
I to 
I ™ 



co 2 ^! 



cn co ! 
I cn cd cn j 

| CN I CN ; CN ! 
CO } O CO 



CO 
O 



i< 



O I CN 

CN ; 



LO ' LO 

cn : cn 

CO , to 

CN ' CN 



! LO • tD 
o cn 
cm ; cn 
co I to 



W)2 Q 



cn • cn : cn 
| N ; n J 

! OJ I (M , IN j 



* oo 

cn cn 

cn cn 

co CO 



CO 
CO 
CO , 

cn 

CO ' 
CN i 



o 

CD 
< 



CO 



< 
o 
o 
< 
m 



CD 



i ^ 

I J CN 



E ^ 
o to 

cl CN 
0) UJ 

a 

CO CO 



UJ 

a 

CO 
CL 



CN 

Oi 
cn 
cn 
to 

CN 



o 
o 
n 

CO 



CD 



o 



CO 


O 


o 


cn 




LO 


r— 


CO 





o 



CO I TT 

co J cn 

^ t- 

CN CO 



S2 r- 
IE co 
Q.cn 
o o 
-o Cl , 

>-o 

E cn 

JS co 

-c a: 

a < 



LL 

cr 



E 

CO 

E ^ 
JP o 



cr 

CL 



CO ro 



cn 
o 
lo 



o 

CO 



o 
co 

CN 



LO 

o 

CO 



CO 

o 

CO 



o j 

CO 



LT> I LO 

o o 

CO CO 

CM CM 



183 

BNSOOCID: <EP_ 1108790A2_I_> 



EP1 108 790 A2 



10 



15 



20 



oj .c 
-era 



E ^ 
to 



25 



30 



35 



40 



-o 

0) 



cm 



C 

> <u 

t CO 

CO TO 

CO t- 

O <U 



• O 



o 
E 
o 
X 



Is 

X> > 

o a: 
o r- 

>■> CO 

^ X 



( ) 



o 
> 

CL 



a 



E ™ 



o 



o 



ao 

CM 



<J o 

co if) 

O 00 

n ^ 

< z 



O 



< 

U 

o 

5 

(X 



.=? en 
5 > 

_D > 

o a 

CD 



a) 

o u*) 
o.O 

O) CM 

^ a 
to c/) 



h- 
o 
> 
5 



50 



o , 



si: § 



55 



a ^ < 

LU O 7 

S ^ a 



o 

CO 



O 

CM 

o 

CO 



Q Q 



Ph 




a) 








ne 








neti 




bui 




TO 




OJ 
X 

o 

O 


sucD 



PE 


CJ 


< 


o 




CO 




CO 


X 


CO 


"c 






CO 


a3 




CL 


J3 


E 


ZJ 


ZJ 


CO 


ropyr 


cillus 




ro 




CD 



< 



TO 
CO 
X 

O 
o 

o 

3 
CO 



CO 

to 



co 

CD 



cm r\i 

CM I CM 







E 








N 




£Z 




QJ 




O 




O 




< 




O 


a> 


a 


CO 




ro 


ny 




cci 


ns 


=j 




CO 






=3 = ~ 



O < 



cn 

CO 
CD 



CT; 
00 



o 

CJ 



< 

o 



m 
O 

I 

CO 

cr 
2 



! CM CM I 
CO : CO I 
CM ■ CM 







^ 1 


UJ 


r- 




CM 


CM 1 


CM 


CM 




CO 


CO 1 


CO 


CO 


s 


CO 


CD = 


to 


CO 


I" 


^ 


in i 


to 


1 


1 <N 


CM 




CM 


CM 


1 0° 


co 


s i 


CO 


ao 


r 






CM 


CM 

i 



184 

BNSDOCID: <EP 1 108790A2_I. > 



EP 1 108 790 A2 



5 
10 
15 


Function 




phosphate transport system 
regulatory protein 


1 

| phosphate-specific transport 
.component 


phosphate ABC transport system 
permease protein 


phosphate ABC transport system 
permease protein 


phosphate-binding protein S-3 
precursor 


acetyitransferasc j 




hypothetical protein | 


hypothetical protein 


branched-chain amino acid 
aminotransferase 


hypothetical protein 


hypothetical protein 


5'-phospho/ibosyl-5-aminoimidazole 
synthetase 


amidophosphoribosyl transferase 


Matched 
length 

(a.a) 




CO 
CM 


LO 
LO 
CN 


CN 

cn 

CM 


LO 
CM 
CO 


CD 
CO 
CO 


LO 
CO 




M" 
CO 


LO 
CM 
CM 


cn 

LO 
CM 


CN 
LO 
CO 


CO 

in 


CO 


CM 
CO 
T 




































20 


Similar 
(%) 




81.7 


, 

82.8 


82.2 


76.5 


56.0 


60.0 




55.2 


74.2 


56.0 


79.0 


81.0 


94.2 


89.0 


25 

CU 
Z3 

c 
o 
o 

30 ^ 
CD 

35 


Identity 
(%) 




46.5 


58.8 


T 

LO 


50.2 


40.0 


34.3 




r» 
t 

CM 




28.6 


58.5 


58.6 


81.0 


70.3 


Homologous gene 




Mycobacterium tuberculcsis 
H37Rv RvOB21cphoY-2 


Pseudomonas aeruginosa pstB 


Mycobacterium tuberculosis 
H37Rv Rv0830 pstA1 


Mycobacterium tuberculosis 
H37Rv Rv0829 pstC2 


Mycobacterium tuberculosis 
H37Rv phoS2 


I Streptomyces coelicolor A3(2) 
SCD84.18C 




Bacillus subtilis 168 bmrU 


Mycobacterium tuberculosis 
H37Rv RvOB13c 


Solanum tuberosum BCAT2 


Corynebacterium 
ammoniagenes ATCC 6872 
ORF4 


Mycobacterium tuberculosis 
H37Rv Rv0810c 


Corynebacterium 
ammoniagenes ATCC 6872 
purM 


Corynebacterium 
ammoniagenes ATCC 6872 
purF 


40 


db Match 




o 

CO 

o 
r-- 

UJ 

5. 


pir:S68595 


i 

< 

CO 
CL 

CL 

cn 


T 
CO 
LO 

o 
r-- 
< 


pir H70583 


gp:SCD84_18 




CO 
CD 

t 

cr 

CD 

Cl- 
ot 


pir.E70809 


i 

CO 
T 
CO 
CO 

cn 
< 

CL 
O) 


CO 

I 

CO 

in 

CO 
O 

o 

CO 

< 

Q. 
cn 


cn 
o 

CO 

o 

r— 
CO 

Ql 


m 

I 

CO 

m 

CO 

o 
o 
CO 

< 

CX 
O) 


T 

1 

co 

LO 
CO 

o 
o 

CD 

< 

Cl 

cn 






o 

CO 


CM 
CO 


r*- 
cn 

CO 


J 921 


' 1014 


1125 


CO 
CO 


CO 
CO 


1095 


CO 
CO 


CN 

cn 


1101 


CO 
i — 
CM 


T 

o 


CM 
CO 


45 


Terminal 
(nt) 


2731424 


2733367 


2733455 


T 
CO 
CM 
-TT 
CO 
fv. 
CN 


CN 
O 

CM 
LO 
CO 

CM 


T 

O 

CO 

r- 

CM | 

i 


2737836 


| 2739553 


2739556 | 


2741356 


2741636 


2743785 


CM 
CM 
CM 
"T 
T 

r^- 

(M 


2744881 


2746083 


50 

i 
i 


I? 

£~ J 


2732230 


2732636 


j 2734351 
: 


2735184 


2736215 

| 


2737538 


2733711 


j 273B771 


2740650 


2740670 


2742577 


2742685 


2744010 


2745954 


2747564 




w ^ 2- j 


6328 


cn 

CM 
CO 

co 


6330 


6331 


6332 


6333 


M* 

CO j 

co 

CO 1 


6335 j 


,6336 


CO 1 
CO j 

co ; 


CO 
CO 

ro 

CO , 


6339 


o 
t 
ro 

to 


s i 

to ! 


cm 

T 
CO 
CO 


! 


SEG 
NO. 
(DNA) 


2823 


cn 

CM 
CO 
CM 


2930 


2331 


2832 


2332 


«r ! 
co 

CO • 

c , 


2835 


2836 | 


2837 


2838 


2839 


O 
T 

CO 

CM 


2841 


CM 1 
T 1 

co • 
<x.. j 



BNSDOCID: <EP 1 108790A2_I_> 



185 



EP 1 108 790 A2 



5 
10 




Function 


hypothetical protein 


hypothetical protein 


hypothetical membrane protein 


hypothetical protein 


5'-phosphoribosyl-N- 
formylglycinamidme synthetase 


S'-phosphoribosyl-N- 
formylglycinamidine synthetase 


hypothetical protein 


2 S 

1> c 

CL 

0) ™ 

§ s 

C JO 

2 5 
cn tu 


hypothetical protein 


C4-dicarboxylate transporter 
dipeptidyl aminopeplidase 


15 




Matched 
length 
(aa) 


CM 


LO 

T— 

CO 


CM 


CM 

n~ 


CO 
CO 


CO 
CM 
CM 




CO LO 

.0 to 
CO 


CM 


t~ cn 
tt to 


20 




E — 

co 


75.8 


94.0 


87.1 


71.0 


to 
cn 

CO 


93.3 


93.7 


77.9 
51.5 


68.7 


CD tO 

o 

oo 






Identity ! 

(%) 


57.3 


75.9 


67.7 
_ ■ — 


o 

lO 


to 


80.3 


81.0 


CM I O 
CO CO 
CM 


37.4 


49.0 
41.8 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gens 


Mycobacterium tuberculosis 
H37Rv Rv0807 


Corynebacterium 
ammoniagenes ATCC 6872 
ORF2 


Corynebacterium 
ammoniagenes ATCC 6872 
ORF1 


SuUoIobus soUataricus 


Corynebacterium 
ammoniagenes ATCC 6372 
purL 


Corynebacterium 
ammoniagenes ATCC 6372 
purQ 


Corynebacterium 
ammoniagenes ATCC 6872 
purorf 


CO 
CO 
CD 
CL 

O CO 

CL = 

CT 

-o 

o ro 

o c 
o o 

o ez : 

o »- o 

5> =a i 
-J < c . 


Mycobacterium tuberculosis 
H37Rv Rv0784 


Salmonella typhimurium LT2 
dctA 

Pseudomonas sp. W024 dapbl 




db Match 


pir:H70536 


CM 

1 

CO 
LO 

CO 

o 
o 
CD 

< 

CL 

cn 


oo 

LO 
CO 

o 

CD 

CO 

< 

id. 
cn 


CM 

1 

o 

<o 

CD 
CO 

5 
CO 

1 to 

: a 

: CD «<r 


gp:AB003162_3 


CM 

CM 
CD 

CO 

o 
o 
CD 

< 

CL 

cn 


i 

CN 
CO 

CO 

o 
o 

CD 

< 

CL 

cn 


prf:2420329A 
prf 2216389A 


cn 
o 
r— 
o 
r*- 
O 

Q. 


sp:DCTA_SALTY 
prf 2408266A 






u 


UO 

r- 
co 


r- 
O 


I - 
I 2 


co 

CO 


2286 


o cn 
cm to 
to 


rO CM 

cm 

CN LO 


- CO 
^ CM 


co 

r>- go 

CM CO 


CO CC 

CO 1- 
co "~ 

f- CM 


45 




Terminal 1 
(nt) 


2747683 


2749111 


2749162 


2752103 


2750027 


2753121 
2752327 


u-> | cn 
cn t- 
cn oc 
cm <r 
lo tr 
h-- 

cm cn 


2753328 
2756739 


2757126 
2757129 


2757863 
2759532 


50 




Initial 
(nt) 


2748057 


LO 

I s 


! CN 

o 

I 0) 

cn 

! XT 


2751916 


2752312 I 

J 


2752402 
2752995 


2753237 


2753804 
2753992 


2756851 
2757815 


2759200 
Z / u l o^y 






| co Z . 


6343 


i 5 

. ro 

i to 


I 

n^4 r i 


6346 


r-- 
**r 

CO 

o 


6348; 
6349 


DJt>U 


6352 
6353 


5354 
5355 


5356 
" 5357 


55 




CO ^ f 


r co 1 

^ | CO CO 
D tM cm 


iO 
TJ" 
CO 
CM 


CO 
T 
CO 
CM 


! * 


I co ; ct> 

: -j- . 

; co ; co 

j CM ; CM 
t 


o *■ 

LO U 

CO c 
CN c 


— CM ro 

1 LO LO 

O 00 oo 
M CM CM 


2854 
2855 


2855 
2857 



BNSDOC1D: <EP 1 108790A2_L> 



186 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



E 



S5£ 
-g o F 

</> o o 
: o c n 



2 ~ 



is ^ 

CO 



CD 
CM 



o 
o 



E< 

3 f> 
CD ~ 



c o 

cT E u 

Oreo. 



LL 7- 



c A 
E c 



50 



55 




o 
CD 
< 



CO 



CD 

o 
o 

E < 

- = CD 

S> 

ra 



^ E CD 
U m a 



0 








"E 




ro 




C 




0 




_>> 




o> 




>s 




l/l 




O 




-C 




O 


<u 


XT 


to 


c 


TO 


CD 




O 


<D 


-C 




CL 


c 






Cn 





0 




0 








< 












O 




ro 




^ro 




~o 




LO 




1/) 




^3 








O 


ID 


O 


in 




CM 


3 


CD 


CO 





CD 
< 



ID 
CN 



O 

CO 

-J 

D 

CO 



CD 
O 

o 
h- 

E< 



E O 
° E 5 

(J ro a 



CD 
< 





a) 


O 


r»- 


in 


O) 


co 




CD 


CD 


r- 




CM 





CO 
CO 



o 

(D 

ro 



co 



0) 

o 

CL 



IS? 



ro 



ro 



co — <-> 

« S R 

0 ro w 
!E a> ai £ 

c O ° 

1 C .£ e 
> ra p ro 

£ c ro ~ 

8 gf g 

ro o ro 



E £ 
V "5 

II 



3 

(J 

(— ' 

CL 
Q 



CM 



c CO 
E CN 
3 — "1 

■8* 
5 E 
!> 

■g g 

T> ® 
ro <j 

^ CD < 

O IE .5 



CD 

rr 
o 
o 

«' 

o 



CD 
CD 



o 10 

CD O 
CO r- 



CM 
CN 



En 
CN* 
3 —3 

*! 

3 i 
|i 

ro "o 
.a ru 
<D xi 
c c 

Q 



or 
O 
o 

1 

D 
O 
CD 



m 
O 



88 

o co 
ro ZZ. 



CD 
CD 



CD I CD 

ro ro 

CO CD 



CD 
CD 
CO 
CD 



CD 
CO 
(D 



uo 



r>i ; cm i cm 



<D 

E 
>* 

N <l) 
C CO 

<y jo 
ro o 

X 2 

o ro 

12 



ro <i> 
ro 



Q 



t: 
< 



< 
o 



o — 

r> . co 
to to 



ro 



j r^. I 
co < 00 I 

CM 1 OJ 



BNSDOCID: <EP 1108790A2 I > 



187 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




188 

BNSDOCID: <EP 1 108790A2_I_> 



EP 1 108 790 A2 



5 
10 
15 


Function 1 


ABC transporter 


hypothetical membrane protein 


transposase (ISA0963-5) 




3-ketosteroid dehydrogenase 




lipopolysaccharide biosynthesis 
protein or oxidoreduclase or 
dehydrogenase 


dehydrogenase or myo-inositol 2- 
dehydrogenase 


shikimate transport protein 


shikimate transport protein 


transcriptional regulator 


ribosomal RNA ribose methylase or 
IRNA/rRNA methyitransferase 


cysteinyMRNA synthetase 


PTS system, en2yme II sucrose 
protein (sucrose-specific IIABC 


component) 


sucrose 6-phosphate hydrolase or 
sucrase 


glucosamine-6-phosphate 
isomerase 


N-acetylglucosamine-6-phosphate 
deacetylase 


Matched 
length 
(aa) 


ro 

CM 

rvi 


m 

CO 


CO 

o 

CO 




CO 
m 


i 
i 


O 
Ol 


CC 
OJ 


CM 

cn 

CM 


c 

CO 


CM 
CM 


CO 
CO 


rr 

CO 

-3- 


CO 
CO 
CO 


CO 

r»- 

T 


i - 
i ~ 

! 


CO 

CD 
CO 










































20 


Similar 


OJ 

ro 
CO 


CO 


W3 

cm 
m 




o 

CM 
CO 




56.4 


69.5 


67.5 


CO 

o 

CO 


55.7 


CO 


CO 
CO 
CO 


77.0 




56.9 


69.4 


CO 

o 

CD 




Identity 
(%) 


-ST 
CO 


50.0 


23.4 


- — 


csi 

CO 




34.3 


35.2 


30.5 


43.1 


32.6 


22.8 


OJ 

oi 


47.0 


35.3 


38.3 


30.2 


25 








































CO Co 

cn 5 

Table 1 (continued) 


Homologous gene 


Staphylococcus aureus 8325-4 
mreA 


Mycobacterium tuberculosis 
H37Rv Rv2060 


Archaeoglobus fulgidus 




Rhodococcus erythropolis SQ1 
kstD1 




Thermotoga maritima MSB8 
bplA 


Bacillus subtilis 168 idh or iolG 


Escherichia coli K12 shiA 


Escherichia coli K12 shiA 


Streptomyces coelicolor A3(2) 
SC5A7.19c 


Saccharomyces cerevisiae 
YOR201C PET56 


Escherichia coli K12 cysS 


Lactococcus lactis sacB 




Clostridium acetobutylicum 
ATCC 824 scrB 


Escherichia coli K12 nagB 


Vibrio fumissii SR 1514 manD 


40 


db Match 


OJ 

1 

OJ 
CO 
OJ 

ll 

< 

CL 
CO 


pir;E70507 


|pir:A69426 




CM 

«' 

CM 

cn 
CC 

cn 
o 
u_ 
< 

id. 
cn 




pir:B72359 


sp:MI2D_BACSU 


_i 
O 

CJ 
LU 

<' 

I 
CO 
CL 

in 


1j 

o 
o 

LU 

«' 

X 
CO 

CL 

</) 


1 

< 
m 

CO 

a \ 


i— 

CO 

< 

UJ 

co 1 
in 
i- 

OL 
CL 

tn 


_j 

o 
o 

UJ 

1 

o 
> 

CO 

ol 

CO 


o 
cn 

CO 

CO 

m 

o; 

CL 


T 

1 

ro 
o 
in 
o 

CM 

LL_ 

< 
CL 
CD 


-j 
O 
o 

UJ 

i 

CD 

Cl 
</> 


D 

LU 
CO 

> 

O 
< 

Cl 

CO 




ii 


O 
CJ> 
CO 


-n 
in 


1500 


o 

OJ 


1689 




CO 
CO 


in 

CO 
T 


m 
in 

CO 


CD 

CM 


U"> 
CO 


C7> 
CO 
CO 


1380 


CO 
CO 

cn 


1299 


CO 

uo 


1152 


45 


Terminal 
(nl) 


2798509 


2799391 


2801034 


2801313 


2801558 


2803250 


2804074 


2804676 


2805113 


CO 

o 

CD 
O 
CO 
CM, 


2806599 


2807426 


2808399 


■*T 
OJ 
CO 

cn 
o 
co 

OJ 


2811960 


2813279 


2814081 


50 


ro 

C — - 


2797020 


2798837 


2799535 


2801113 


2803246 


2803996 


2804691 


2805110 


2805967 


T 
CO 

o 

CO 
Ol 


2807252 


2808364 


2809778 


2811806 


2813258 


CO 

o 

CO 

CM 


2815232 






6391 


6392 


6393 


cn 

CO 
CO 


6395 


6396 


6397 


6398 


6399 


o 
o 

CD 


o 
t 

CO 


6402 


CO 

o 

CO 


*r 
o 

CO 


6405 


CO 
O 
*T 

CO 


6407 


55 


SEQ 

NO. 
. (DMA) 


2891 


2892 


2893 


cn 1 

CO 1 

oj ; 


2895 


2896 


2897 


2898 


2899 


2900 


2901 


2902 


2903J 


O 
OJ 


2905 


CO 

o 
co 

OJ 


2907 



BNSOOCID: <EP_ 1108790A2_L> 



189 



EP 1 108 790 A2 



3 



10 



15 



20 



25 



30 



35 



40 



45 



50 



■D 

a> x: ^ 
To £ ^ 



TO „ 



GO 



-a 



c 

o 
o 



.a: 



o 
cn 

o 
o 

E 

o 
I 



O) 
o o 
o O 
^ CM 

E o 
o 

CD CD 

^ O 

a) co 



O- 
CD 



3 

CL 

« I 

o o 

s> - 

C O) 

CO C 

GJ C 
— -? 

"o a> 

2 E 



CO 

o E 
q. a) 

c o 
2 

•O to 
OJ 

a. c 

.9- ai ! 
-a cl 



D) O 



o 5 
cn c 

"6 a. 



E lu 



175 oo 
o 05 



O 

o 

I — 
GO 



cn 
o 

CD 



< 

CN 



> < 

o c 

s| 

g R 
o O 

I" 



ro 



£ 

X) 
O 



< 



cn 

CO 



CO 

ao 



c >> 

Q> LU 
O 

e r 

o o 



c Si 2 
" o ra 

<1> >r O 

ill 



3 | 

=3 P 



CO 



> 

O 



55 



CO 21 





O 


CO 


m 




CO 


00 


CO 






CD 


CO 


CM 


CXI 



Li. 
< 



CD 
ZD 



uu u s 
oo 2 a 



CD 
O 

cn 
cm 



CN 



ro 






co 


co 


LO 


CO 


cn 






CO 


CO 


CM 


CN 




CN 








~3 


to 


to 




CM 








cn 


CM 





LO 
CO 
CM 

o 



CM 

5 



< 

CD 

I 

CD 

CL 
CL 
O 



co 

CM 



D 
CO 

o 
< 
cn 

o' 

Cl 

a. 

O 

CL 



CO 
CD 

o 



i 5 



5 

(J 

5 



CL 

CL 

o 

CL 



CD 
CN 



o 
o 

LU 



X 



O 



a) 
o 

CL 



CM 



o 



O 
CL 



C 

ro 



cn 


t 


CO 


in 


CD 


CO 


if) 







|£ 

.2 LO 

a » 

X) > 

o cc 

^ CO 

S X 



0) 
X) 

u 

Is 
*5 

ro . 

X) > 

o or 

u r- 

CO 

S X 



CD 

*- cj 

.is 

55 
£ a 

ro ^ 
X) > 
O C£ 

o r- 

^ CO 

2 X 



co I -3- 



cn 

CM 



cn 

CD 



cn 

CM 



i ^ 
i 



s 

o 



o 

co 



ro 
O 
CO 

o 
X 



co 

CM 
CD 



ao 

CM 



cn 

CM 



O i 
CM 

co j 



t- 1 CM 



CM 
CD 



- | CM 



t ^ 



190 

BNSDOCID: <EP 1 108790A2_L> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



0) 



"O 

o x: 

ro tu -— 
5 ~ 



to 



-D 



2 



50 



St 



0) 








c 




o 




CL 




*/) 




CJ 








E 




o 




to 




>v 




VI 








c 




0) 




c 




o 




CL 




E 


5 


CO 


ro 




=3 


6 


m 




CD 



I § 

§1 

CL 



o 

c 



£f < 

= E 
I- 

a > 
8 » 

-Q > 

O (T 
S X 



o .£j 

o -o 



LO 
0J 



O 

o 



to 

LLI 
< 

CD 

CL 



o 
o 



55 




ID N 

I cm cm 

I *T ! TT 

- CO 



CN 

CD 
CN 



I CD 

I ™ 

O 



o 

CL 



CO 
CN 



5 > 



2 x 



o « 



CO 

ro o 

t| 

on 

ro "O 

§1 

£ IS 
£^ 

3 CD 

o> CO 

«/J CO 

CL CO 



3 
Ql 
CL 



O 

to 
ro 



cn 
to 



CN * 

cn t 

CM , 



cn 

CM 

cn 

CM 



2 * 

JD 

.2 cu 
o 

■si 

O _C 

p "cD 



C 

'c 



3 



! CO 



T3 



£ 
ro 



CO 



r 
ro 
</> 
o 

£ 3 

o £ 

CL o 

£ f3 

— - r-» 

CO co 



< 



j lo 



co I 



CM 



t- I CM CM 

CD CO CD 

CM • CM • CN 



CO \ > 



to j co 



2t > 

£ a 

JD > 

o cr 

o r- 

2 X 



o 
m 



O CM 
C CN 
t CM 

-S ^ 
"O CM 

CL O 



CO 
CO 

CO 
CM 
> 

CL 
GO 

O 



cr> I 
ro | 



CM 
"3" 



LO 



o 

LO 



cn 

o 
ro 
cn 
ro 

c: 

o o 
£ ^ 

O to 
T3 CM 
^ LL 

a O 



cn 

CM 

> 
n.' 
co 
O 



CO 



BNSDOCID: <EP 1 108790A2J > 



191 



EP1 108 790 A2 



10 



15 



(0 0)-— 



co 

-2 
co 

Q. 

CO <D 

o co 

_C CO 

o. 



o 
u 

TO 

CD 

u 
c 

QJ 



20 



E — 



25 



30 



35 



40 



45 



50 



03 



8 



■Si 

o 
a. 
cd 
c 
-a 

a < 



c 

CO 

o 
c . 
QJ ' 
T3 



CO 

n 

CL 
Ui 
O 

O- 

o co 
£ c 

O 



CD 



55 ^ 



CD 

5- 

o 
o 
c 
o 

E 

CM 

o 

c 

0) 



(0 

CO 

o 

CD 

aj 

f0 

CO 
CO 

go 

E ^ 
o un 
-o CN 

s& 

So 



DQ 
O 
CD 

E 

CO 
CD 



to 
CO 

O 

ro 
CD 



c 



Q- 

a> 



CL 

E 



>■ 

CL 
CO 

O 



CO 

o 
< 

CD 

I 

CD 

o 

LL) 



CD 



co 
m 
ro 
o 
CD 
< 



TO 

E — - 

E c 

- — 



CO 

o o 
p to 



CO i 

O 



CD 



55 



2g: 

co 2 <o ; 



2q| s 

co ^ O I oj 



ZD 

o 

lY 



c 

QJ 
O 



O 
CL 



or 





CM 


cn 




ro 


o 




r*- 


r- 




ro 


m 


LO 


in 


CO 


CO 


CD 


0Q 


CM 


CM 


CM 



CO 
QJ 

c 

c? CD 
o c 



to 




cn 




O 




13 








a3 
















E 


r- 








CO 

> 


Tj 




ro 


-Q 


> 


o 


CC 



E 
E 



o 

CL 



CL 
. O 
tD ±= 
) XJ <y 

ef S 

• E » 
o >> o 

C X -C 

poo. 

§■6 2 

cm r a 





CD 




CD 


O 




CM 


CO 


LO 


CO 


CD 


CO 


cm 


CM 


CO 


1 ^ 




1 ^ 



:>■» co 

2 x 



o 

CD 
O 

r- 

O 



co 



CO ! CO o 



LO 


CO • 


f- 




T • 




CD 


CD 1 


CJ) 


CM 


CM 1 





CO I CD 

I f ] ^ 
i cn oo 

| CJ i CM 



- ! 00 



o 
cn 

CO 

co 

CM 



o 

CD 
CO 

1 r\i 



CO 
3 

i s 

c °" 

flS CM 

-j- ro 
ra „ 

CJ 

co 
c^O 

o I— 

a < 



< 

O 
o 



- ° 

aj co 

™ CM 



0) 

3 

O 
QJ 



« O 
>. *- 
-C ^r- 



E 

_3 

cu 

CO : 

°\ 

O 1 



CD 

O 



cn 

CM 



co 
oo 

CM 



I LO 



X 

LU 



CL 
Q- 
X 



CD ro 
ro 



CD 



LO I CD 

CD i CD ! 

CO I CO I 

CM i CM 



CD ! CD 



LO I 
CD , 
CM 



CM ! 



CD 

to 

CO 



-I 



BNSDOCID: <EP 1 108790A2_I_> 



192 



EP 1 108 790 A2 















CJ 

c 




























5 

id 
1 u 


Function 


GTP cyclohydrolase I 




cell division protein FtsH 


hypoxanthine 
phosphoribosyltransferase 


cell cycle protein MesJ or cytosi 
deaminase-related protein 


0) 

C OJ 

to 
re -2 
ro ^ 

O 0J 

_i. °- 
>» 

CO 0 
-— JD 
CO k_ 

6 S 


inorganic pyrophosphatase 




spermidine synthase 


hypothetical membrane protein 


hypothetical protein 


hypothetical piotein 


hypothetical protein 


PTS system, beta-glucosides- 
permease II ABC component 




ferredoxin reductase 


hypothetical protein 


bacterial regulatory protein, mai 
family 


15 


Matched 
length 
(a a) 


co 
CO 




CM 
CO 


LO 
CO 


0 

CO 


CD 


in 




r- 
0 
in 


CM 
CO 




CO 


CM 
O 
CM 


00 

CO 






r*- 
cn 


in 

CO 












































CD 


CM 




O 


0 


CO 




CD 




r»- 


T 


CM 




CO 


to 




CO 


CM 


CO 


20 


Simil 


CD 
CO 




CD 

to 


CO 
CO 


co" 

CO 


in 


CO 

r«- 




0 
00 


CO 
CO 


ro 
CD 


0 
to 


CM 

r» 


CD 

m 




cn 

CO 


co 
r- 


CO 

m 






CD 




O 


cn 


0 


CM 


r- 




O 


CO 


OO 




CO 


ro 




0 




r-. 




IS 


d 

CD 




CD 
LO 


to 




r-" 

CM 


O 

T 




to 
in 


CO 
CO 


CD 
CO 


to" 
ro 


VT 


O 

CO 




co 
ro 


to 


CO 
CM 


25 

OJ 

ZD 

c 
c 

8 

30 ^ 

CD 

35 


Homologous gene 


Bacillus subtilis 168 mtrA 






Salmonella typhimuriurn GP660 
hprt 


Mycobacterium tuberculosis 
H37Rv Rv3625c 


Actinomadura sp. R39 dac 


Escherichia coli K12 ppa 




Mycobacterium tuberculosis 
H37Rv speE 


Mycobacterium tuberculosis 
H37Rv Rv/2600 


Mycobacterium tuberculosis 
H37Rv Rv2599 


Mycobacterium tuberculosis 
H37Rv Rv2598 


Mycobacterium tuberculosis 
H37Rv Rv2597 


Bacillus subtilis 168 bglP 




Nocardioides sp. KP7 phdD 


Streptomyces coelicolor A3(2) 
SCH69.09C 


Burkholderia pseudomallei ORF 
E 


40 


db Match 


GCH1_BACSU 






CO 
CD 
CO 
O 
O 
LL 
< 


YZC5_MYCTU 


cx 

CO 

t— 
0 
< 

1 

0 
< 
a 


IPYR_ECOLI 




H70886 


Y0B1_MYCTU 


Z> 

r- 
O 
> 

1 

CN 
CO 
O 
> 


h- 
0 
> 

l 

CO 

CO 
O 
> 


z> 

r- 
O 
> 

1 

co 
0 
> 


3 

CO 

0 
< 

00 

«■ 

CD 
r- 
CL 




0 

0 

cn 
< 


6 69HOS 


2516298U 






d5 






Li- 
en 


sp: 


ds 


sp- 




pir 


ds 


:ds 


ds 


ds 


ds 




CL 


CL 
CD 


CL 




ORF 

( 3 P/ 


CO 
CO 
LO 


LO 

O) 


2580 


CM 
CO 
LO 


cn 

CO 


1233 


T 


CD 
CM 


1539 


CTi 

cn 
ro 




co 
cn 


cn 
O 

CO 


rr 

CM 


CO 
CM 


1233 


CO 

CO 
CM 


rr 
rr 


45 


Terminal 
(nt) 


2866586 


2668385 


2867169 


2869863 


2870499 


2B71445 


2873399 


2873393 


2873905 


2875434 


2875870 


2876280 


2876777 


2877455 


2877595 


2878478 


2880252 


2880987 


50 


Initial 


2867173 


2867471 


□0 

r^. 
0 

CD 
CO 
CM 


■c 
0 

co 

CM 


2871389 


2872677 


to 

CM 
OI 
IN 

r— 

CO 
CM 


2873611 


CO 
vr 

in 
r- 
co 

CM 


2875832 


2876280 


2876777 


2877385 


2877703 


2877858 


2879710 


2879965 


2880544 




So « 

CO ^ ™ 


CM 
CO 
T 
CO 


co 

CD 
CO 


CD 
T 
CO 


LCI 

co 

CO 


CO 

to 

T 

to 


6467 


CO 
CO 
rr 

CO 


6469 


6470 


6471 


6472 


5473 


6474 


6475 


CD 
f-- 
T 
CO 


tn 


6478 


CD 

r- 
CO 


55 


j co Q 


2962 


2963 ! 


2964 


in 

CO 

0 


2966 


2967 


2968 


2969 


2970 


2971 


2972 


2973 


TT 

r- 

CT) 
CM 


2975 


2976 


2977 


2978 


2979 



BNSDOCID: cEP 1 1C8790A2_I_> 



193 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2_L> 



194 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



c 

LL 



T3 



2 C ra- 
re £ ^ 



CO 



O 
cn 
a 
o 

E 
o 
X 



^: 

o 2?< 

CD ^= i 
"C - 
o "O < 

CL C « 
~ <tJ . 

m 0) ' 

+ c . 
X <5 



^ cu 



^ 5 _cu 



3 ^ 

E g, 
5* 

O TJ 
CL C 
~ CO 

X ro — 

+ 

TO </» O 

2 2 5. 



TO 

CU 

CU c 
Q. O 

1 2 

E g, 
o £ 

O "O 
CL C 

to 

r 2^ 

X m — 
^ « a) 

TO ^ 2 

2 £! cl 



o o 

.9- '5 

3 ^ 

EOT 

5 X 

E °- 

CU "O 

to c 

>* ro 

CO Q> 

x o u_ 

3 c 

f is - 
« -2 

. \n O 

* 2? Q. 



< 

CL 



o 


Q 


LLf 


Ol 


CL 


Q. 


E 


mr 


mr 






T 


LL 


LL 


U_ 


O 


o 


O 


tn 


*/> 


to 


E 


3 

E 


=1 

E 






«*s 


to 


to 


tn 


3 




_2 


o 


1 


u 


CO 




CO 


m 




CQ 



55 



ro 1 ~. 



2 9 «i 

co 2 S. 



lu 2 5 

CO ^ O 



O 

o 







AF097740. 


AF097740, 


CL 

cn 




a. 


1668 





E S, 



x 

CL 
T3 



2 * 



CM 
CM 



cn 

CM 



CM 
O 
LO 
CD 



CL 

To 



aJ > 

u .9- 
— 

-O > 

o or 

>» m 
2 X 



E g 

CO c 

*- 

P c 

< o 

2: o) 



5.1 



O 



CO 

co 



o 
o 
ro 



CM 
O 
O 



CM 



00 

o 
< 

CQ 

1 

i&j 

I Q 
a. 



u or 
fo 

x» > 

o or 

IS 



q 

Ol 



5 ? 

-Q > 
O QT 
a 

2 X 



CD ; ro 

— cn 

QO J CM 
CO O 

— CM 

en ! cn 



I 7- 

j c> 

CM 



CD 



CU 

c 
o 

E < 

CO £ 

CO S< 



< 

CL 



a> i cn 

CM . CM 



CM J 



I CO 



to r- co : 

o 1 o o I 

in in u) i 

to 1 co - to 



o 
o 

ro 



o 
o 

ro 



I ^ I CM 

! m I \n 

O j (O 



to 



BNSDOC1D: <EP 11C8790A2 l_> 



195 



EP 1 108 790 A2 



5 
10 


Function 


membrane transport protein or 
bicyclomycin resistance protein 

sodium dependent phosphate pump 


phenazine biosynthesis protein 


ABC transporter 


ABC transporter ATP-binding protein 


mutator mutT protein 


hypothetical membrane protein 


glutamine-binding protein precursor 


serine/threonine kinase 


t 

i 


ferredoxin/ferredoxin-NAUK 
reductase 

acetyltransferase (GNAT) family 


OJ 

E 

CO 

c 

o 

CD tO 

S 5 

- C C 
o CO 
<C 

CL~>* 

S | 






15 


Matched 
lenath 
(aa) 


CO CN 
CD CO 

co cn 


CJ) 
CO 
CM 


to 
in 

CM 


CD 
O 
CO 


CO 
CD 


CO 
CN 


o 

CN 


m 
o 

CO 




r- co 
in uo 


CD 
CO 






20 


Similarity 
(%) 


67.2 


56.4 


60.8 


66.3 


68.5 


70.2 


64.8 


m 
co 

CD 




67.8 
60.3 


oo c 

82.0 








Identity 

(%) 


31.6 

OR * 


38.8 


CO 

T 

CN 


36.9 

i 


CD 
T 


35.0 


31.5 


41.2 




37.2 
34.0 


59.1 






25 

0) 
=3 
C 

1 

o 

30 r 

CD 

35 
40 


Homologous gene 


Escherichia coli K12 bcr 


o 

CO 

L s 
3 « 

o 

- <D 

? 1 

11 CO 

u c 

5 s 
§ 3 °- 

> 0_ CO 


Streptomyces coelicolor A3(2) 
SCE8.16C 


Bacillus licheniformis ATCC 
9945A bcrA 


Mycobacterium tuberculosis 
H37Rv Rv0413 


Mycobacterium tuberculosis 
H37Rv Rv0412c 


Bacillus stearothermophilus 
NUB36 glnH 


Mycobacterium tuberculosis 
H37Rv Rv0410c pknG 




Bos taurus 

Escherichia coli K12 elaA 


Bacillus subtilis 168 pur 






db Match 


—i 

o 

u 

LU 

I 

QL 

O 

cn 
o_ 

CO 


t < 
30 UI 

^ o 

< INI 

o X 

> CL j 
Cl cl. 
C7> j CO | 


CO 

t 

CO 
LU 

O 
LO 
Cl 

CJ) 


O 
< 

CO 

«' 

Ql 
(J 
CP 

CL 
trt 


pir C70629 


pir:B70629 


h- 
CO 

o 
< 
m 

i 

X 

-z. 
_J 
CD 

Cl 
if 


pir:H70628 




■z. - 

§ a 

o <' 

(X < 
Q —3 

< LU 
Cl Cl 

CO CO 


Z3 
CO 

O 
< 
m 

i 

»- 
o: 

Z> 
CL 

I s- 




! 






s l ! 


? 5 

H CO 


CO CO 
CO CO 

CO 


CO 
CO 
CD 


o 
m 


1366| 


1032 


2253 


r-~ 


I 1X5 cn w 

CO £ CO 

■ ? 5 ? 


1 8 £ 

2 co 


CO 
co 
CO 


i 


45 


Terminal 
(nl) 


co cn 
-cr co 

CN CN 

cn cd 

CN fN 


2926704 
2926707 


2927651 
2927551 


2928302 


2929256 


— ' 

2931336 


2932371 


CD 
CN 
CD 
•*T 
CO 
O) 
CN 


CN 
LO 

co 

OJ 

cn 

CD 

CN 


2939767 

2940452 
9940447 


2941472 
2942609 

2943012 


2945639 




50 


Initial 
(nt) 


2924191 
2925147 


t- CO 

m to 
m r- 

CN l CN 
CD ! CD 
CN i CN 


2928233 
2928318 


2929237 


2929756 


2929951 | 


2931340 


2932577 


cC 
cr 
r* 
c 
<*■ 
a 

CN 


2938403 
2939907 


2942500 
2943007 

2944205 


CC 
CN 
LT 

tr 
~? 
a 

o 


t 

) 

> j 

) 
4 




cn 2 ^ 


6515 
6516 


6517 
6518 


6519 
6520 


6521 


1 

j 5522 


6523 


I 1 

'5524 


6525 


CC 

r 
if 

(C 


5527 
6528 


•> O r- CN 

*j <o co co 
) m ift, in 

5 to CO CO 


r 
r 
cr 

cr 


> 
> 


55 


SEQ 

NO 
(DNA) 


3015 
2016 


3017 
3018 


3019 
3020 


3021 


3022 


3023 


3024 


3025 


tc 
r 
c 
r 


3027 
3028 

-moo 


r> o • *- ■ cn 

^ CO J CO j CO 

-> o o o 
i co co co 


3033 



BNSDOCID: <EP 1 108790A2J_> 



196 



EP1 108 790 A2 



5 



10 



15 



20 



25 



35 



40 



45 



50 



55 



Function 


| insertion element (IS3 related} 


insertion element (IS3 related) 


| two-component system sensor 
histidme kinase 


transcriptional regulator 




adenylosuccinate synthetase 


hypothetical protein j 




hypothetical membrane protein 


fructose-bisphosphate aldolase 


hypothetical protein 


methyltransferase 


orotate phosphoribosyltransferase 


hypothetical protetn 


3-mercaptopyruvate 
sulfurtransferase 








atched 

ength 

(a.a) 


o 
o> 

CN 


cn 
o 


o 

CO 


CO 

CN 




rsi 

N~ 


o 

CN 




cn 
m 

CO 


T 

CO 


rr 
o 

CO 


CN 
CO 


r- 


O 

m 

CN 


o 

CN 














































>> 






































imilarii 
(%) 


90.9 


84.3 


51.3 


65.6 




95.3 


59.3 




100.0 


100.0 


100.0 


91.2 


65.5 


60.0 


56.1 








CO 






































Identity 
(%) 


77.6 


67.4 


CN 


31.7 




r- 
cn 
co 


CO 
rr 
co 




100.0 


99.7 


100.0 


76.9 


39.1 


27.6 


29.6 








(u 

c 
Oj 

CO 
cn 
n 
o 
cn 
o 
o 

E 

o 
I 


Corynebacterium glutamicum 
orf2 


Corynebacterium glutamicum 
or11 


Streptorryces thermoviolaceus 
opc-520 chiS 


Bacillus brevis ALK36 degU 




Corynebacterium 
ammoniagenes purA 


Mycobacteriun: tuberculosis 
H37Rv RV03-58 




Corynebacterium glutam cum 
AS019 ATCC 13059 ORF3 : 


Corynebacterium glutamicum 
AS019 ATCC 13059 fda 


Corynebacterium glutamicum 
AS019 ATCC 13059 0R C 1 


Mycobacterium tuberculosis 
H37Rv Rv0380c 


Pyrococcus abyssi pyrE 


Mycobacterium tuberculosis 
H37Rv Rv0383c 


Homo sapiens mpsT 








db Match 


S60990 


S60389 


OC 
CD 

C 
CD 

< 


ex 

cn 
u 

<r 

CD 

1 

Zj 
O 

LU 

o 




i 

o 

CD 

CO 
O 
O 

m 
< 


G70575 




YFDA_CORGL 


S092B3 


«' 

O 

LL 

o 

o 


G70833 


AF058713_1 


CO 
CO 

o 
r- 
m 


z 
< 

Z) 

I 

I 

h- 

x 










CX 


o. 


CL 

Cn 


c: 

on 




CL 
cn 






CL 

in 


CX 


CL 
cn 


CL 


Li 

CD 


CX 


CL 

(/> 








Lt_ • 

rr °- 

oe 


tT 

cn 

CO 


CD 
CN 


1140 


CO 
CD 


u ■> 

(N 
CN 


1290 


cn 
in 


CO 
CN 


1167 


1032 


UTi 

cn 


oO 
iO 


CN 

m 
m 


CN 

r- 
cn 


CN 
CO 
CO 


CO 
CN 

r— 


cn 
CN 


cn 
cn 
ro 


Terminal 
(nt) 


2946698 


2947620 


cn 

O 
CO 
T 

cn 
cn 


2949265 j 


CO 

O 
ir> 
cn 
r\i 


2950434 


2S52691 


2952972 | 


2952975 


CN 

m 
cn 

CN 


2955523 j 


2956830 


2957485 


2958139 


2959520 


CD 

CD 
TT 

o 

CD 
O 
CN 


2962730 


2963198 


Initial 
(nt) 


2947591 


29476B6 


! « 

CO 

CT> 
1 T 

• cn 

CN 


CN 
OO 
CO 

cn 
^ 
cn 

CN 


1 2950207 


2951723 


2951933 


2952709 


m 

S 


2955272 


2956473 


*T 

r-. 
in 
cn 

CN 


, 2958036 


2959110 


! 2960371 


2961137 


2963008 


29635G6J 


So : 

</5 2 -2. 


co 
1 in 


6535 


6536 


6537 


6538 


6539 


o 

XT 
O 
CO 


6541 


cn 

CO 


5 

m 
o 


m 

CO 


6545 


CD 
-c 

CO 

CO 


LO 
CO 


6548 


6549 


6550 


cn 
m 
co 


got 


3034 


3035 


3036 


3037 | 


3038 


3039 


o 

T 

o 
co 


■c 
o 

CO 


o 

CO 


3043 


3044 


3045 


3046 


f— 
■c 
o 

CO 


CO 

<cr 
O 
co 


3049 


o 

8 

CO 


m 
o 



BMSDOCID: <EP 1 108790A2_I > 



197 



EP 1 108 790 A2 































5 
10 




Function 


virulence factor 


virulence tactor 


virulence factor 


3 
D 

J c 

=L o 
£ cL 

^ CP 
CU C 
CO J2 

E - 
2 3 

o. I E 
EE | 

o t- ft) 
cn CL o 


cation efflux system protein 
(zinc/cadmium) 


j 

n 
3 

3 

5 ST 

cn 

o 2 

O Q> 

E o 


alkanal monooxygenase alpha chain 


cystathionine gamma-iyase 
bacterial regulatory protein, lad 
family 


rifampin ADP-ribosyl transferase 


rifampin ADP-ribosyl transferase 


hypothetical protein 


hypothetical protein 


oxidoreductase 


15 




Matched 
length 
(aa) 




o 

O 1 
CM 


CM 
CO 


cn co 
CO o 


CO 
! CO 
CM 


CD 


cn 
o> 

CO 


LO 

r-- co 
co *- 


CO 


CD 

m 


CD 
CO 


O 
CM 


CD 
CO 
CO 


20 




Similarity 
(%) 


82.0 


55.0 


O 
CO 
CD 


CO CO 

in 


63.3 


%r 
to 


47.4 


62.4 
67.9 


65.2 


87.5 


562 


64.7 


CD 

d 

CD 






Identity 

(%) 


76.0 


380 


620 


24.7 
37 0 


23.7 


22.5 


21.1 ' 


36.5 
402 


cn 

T 


73.2 


30.5 


33.8 


31.9 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Pseudomonas aeruginosa 
ORF24222 


Pseudomonas aeruginosa 
ORF23228 


Pseudomonas aeruginosa 
ORF25110 


Synechocystis sp. PCC6303 
slr0525 


Pyrococcus abyssi Orsay 
PAB0462 


Rhodococcus rhodochrous 
IF03338 


Kryptophanaron alfredi symbiont ! 
luxA 


Escherichia coli K12metB 
Streptomyces coelicolor A3(2) 
SC1A2.11 


Streptomyces coelicolor A3(2) 
SCE20.34carr 


Streptomyces coelicolor A3(2) 
SCE20.34carr 


Mycobacterium tuberculosis 
H37Rv Rv0837c 


Mycobacterium tuberculosis 
H37RV Rv0836c 


Mycobacterium tuberculosis 
H37RvRv0385 




db Match 


GSP Y29188 


GSP Y29182 


GSP:Y29193 


CO 
1 CO 
<D 
CD 
r^- 

CO 

'5_ 


s 

W. cn 

^' ? 
Q in 
< £ 
o X 
o. 

co O- 


cn 

CO 

o 

o 
m 

< 

cn 


CO 

< 

> 

or 

i ^ 

; 1 

ZD 
_J 

CL 
lO 


O _ 
O I- 

LLI | 

1 CM 
CD < 
1— 

LU O 
S CO 
CL O- 

m cn 


CO 

o 1 

CM 
LU 

o 

CO 

a 


CO 

1 

o 

CM 
LU 

O 
CO 

CL 

cn 


pir:E70812 


pir:D70812 


pir D70834 








i — 


Ol 

to 
r- 


__i 

ID 

cn 

CO 


1347 


r— co 
co in 

CO CO 


O 


1041 


1146 

567 


! s 

1 ^ 


CO 

co 


1125 


oi 

CO 


1179 


45 




Terminal ■ 
(nt) I 


2964434 


2965837 


2965533 


2966458 


2968789 
2959808 


2971003 


2972057 

007171A 


2972060 
2973230 


29742C0 


2974382 


2975591 


! 2976360 


2977774 


50 




Initial 
(nt) 


2964258 


2965076 


296518B 


2957804 


2968403 
2958951 


2969834 


C 

«- C 

o c 

r- r 
cn c 

CM C 


Z» f /L\JW 

2973205 
2973796 


2973961 


2974200 


2974467 


2975629 


2976596 






So' 

CO 7 * 


— CM 

■o in 

s. £ 


6553 


6554 


6555 


6556 
6557 


6558 


6559 


6561 
6562 


6563 


6564 


1 CD 

i m 
1 <° 


6565 


■v. 

cn 
in 

CD 


55 




CO 2! £ 


I (UNA) 

1 3052 


3053 


I in 

I 8 


3055 


3056 j 

3057 I 


' 3058 


1 cn < 
in . c 
O 1 c 
ro c 


3061 
3062 


3063 


! s 

o 

CO 


1 uo 

! § 

1 " 


3066 


j 3067 



198 

BNSDCDCID: <EP 1108790A2J_> 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



c 
3 



C= CO 

CD — ■ 



Q) ^ 



■o 



c 
o 
(J 



E 

n 
X 



TO 

D </> 

_L CO 

<?° 

E ~ 

co £ 

■e o 

2 co 



E.g 

— Q. 
0) o 

CO O 
3 

E ~ 



on 

0) 

o 

>>co 
E q 

o 

OL< 
CD T 

r: O 
CO CO 



* E 

> w 



J3 

E 



CD 
CO 



cr 
O 



< 



o 

CO 



< 
cr 

DC 
< 
O 
CD 
< 
CL 

o 



c 



CO 

LU I 

C 

i. in m 
o cc c 

o g- 2 

< Q. 
» o TO 

c ^-5 



5 c U 
DO 

a> o 
o E 

0J d) ^ 
C Q. O 



a; 
c 



a> ro 



s > 

o cr 

2 X 



> 



^ CO 



2q 

CO 



2 ro. 



CO 

to 



t - 



tD 



10 to 



lo 1 

. CO , 



— OJ ; ro ; 
o .0 O ' 



O 

a 
cr 
i- 
co 

I 

LU 

0l 

cr 

o 







ro 




CM 




■3 








E 








> 




CO 








E 












T> 




CO 




jO 




> 




2 


CO 






m 





o 



c 

CO 



to 

rvi 



>* 

E 

o q 
o. to 

is O 
co to 



CD 

cr 

cl 



CD 

ro 
co 



10 
co 

CO 



O 

CO 



CO 

to 



CO 
OJ 



a; 

c -o £ 

m ro O 

s 2 ^ 
£ IS o 

I 8 g 

COO) 
.'DO 

m c co 



E 

CO 



o 

a. 
X 



a. 



co 

LL 



CO 
CO 
CO 



o 
a. 



o 
co 



E 
o 

CL 



O 

CL 

X 

o 

CO 



3 



co J 
00 1 



o , 
CO j 



o 

JZ 

o 



lus 








Q_ 




0 




E 




0 








0 






■O 


CD 


CO 


O 


TT 


CO 


CO 


CO 


CO 




CM 


0 


^ 


CO 


CO 


CO O 



CO 

CD 

04 

X 
Q 
< 



CO 



, I r- 



tO 
a> 

CO 



s - 
°1 8 



BNSDOCID: <EP 1108790A2J 



199 



EP 1 108 790 A2 



10 



15 



20 



ZD 



ro n> ' 



25 



30 



35 



40 



45 



50 



-2 ? 
£~ 



"O 
CD 



0J 
,03 



o 

£ 

o 

X 



c 

(U 

o 



o 

Q. 



11 



E 
o3 



o 
c 

0) 

ro CD 
O ^ 



o o 
o 

£ - . 

O CO 



"6 



n 

o 

CM 



r- cn 
o 

CM 



CD 



o 



o » 



55 



O n < 
UJ s 

to ^ 9. 



o 

CO 



O 

u 

ul 



CO 
> 



=J > 



XI tO 

a. >n 
t/i — 
o 

CC (0 



o 

O 
UJ 

a 

CO 

a 



CD 



o 
o 

CN 
O 

o 
o 



LO I 

to ; 



CO 



O [ O 



cr> o 

co cn 

o I o 

co ; cn 



3 
CO 

< 

CD 



X 

o 



a> 




CO 




</) 




> 








<r> 




o 








a> 




u 




>> 




£ 




o 






ro 


to 


_£= 


o 


o 


o 


o 


CN 


00 


— t 


CO 


Lu 



cn ' 03 



CO 

< 

LU 

> 

I 

O 
cr 
a 
< 

CL 



CN 

CD 
O 



CD I <D 



cn 

CO 

o 
o 

CO 



o 
o 
co 



E 

o CO 

"HI cd 

0) LU 
±= O 
CO CO 



q 




KJ 








CO 




Q 




ra 








3 




CL 




t/> 




ra <C 


c 


o 


o 


£ 


£ 


to 


o 


O 


■o 


CO 


ZJ 

a> 


CN 


</j 


CO 


a, co 



o 

CO 



< 

O 

< 

a. 

CL 



CC 

cn 
o 



CD 
CD 
O 



cn 
o 
o 
co 



CN - 



cn 
o 



to 



r- I CN 

O ! O 

CO ' CO 

tO CO 



CD I O 

cn | o 

9! i r: 



CN 




BNSDOCID: <EP 1 108790A2J_> 



200 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ro <d -Si 



J5 ^ 
1 - 



O ^ ^ 

, H O ro I O 

£2: a«S 



o 0 < ^ 

to 2 9. ro 



< O 



LLi 
O 



CO 
O 



o 
o 



o 

CD 
CO 



CD 



E 

O. 



5 c 

re 

O D 

3 cu 



CD 
CD 



E 

CM 

5 



CM I- 
GO 

CO 



O 

o 

LU 

I 

UJ 
Q_ 
<f 
Q 



■r- ; tD 
CO 

o I o ' 



CO C>1 
O 
CD CD 
CD CO 



CD 



21 

roj co J ro 



I CD 

i ^ 



c 
o 



ro 



O .E 
— T> 

£ g 



E 

CM 
5 



O 

o 



a. | 



5 

CL 

rr 
o 



CD «> 
E o 

E «C 



CO 

o 



en j ro 



co j co 



CD 

o 



CL ^ 

o 

~ o 
o o 

« t: 

39 

W o 

o o 

O O 

o ^ 

fa 2 
-J CL 



CO 

ro 
o 



o 
ro 



u 



a 

i 

x 

3 



rsi : 

CD . 

CD : 



<~ , CM 



- J 
CO I 



CJ o 
>» GO 

E 9 
o o 

Q. ™ 

a a 

to CO 



o 

CL 
cn 



O 
O 
LU 

^ < 

O i °- 



cm 
co 



CN 

CM 
O 
CO I CO j 



\r ! in j 
rvi . r%- | 



co j t ; tn 

f>J | OJ ; f\l 



BNSOOCID: <EP _11C8790A2 I > 



201 



EP 1 108 790 A2 



5 
10 


Function 




oxidoreductase 






transcription antiterminator or jeta- 
glucoside positive regulatory protein 




6-phospho-beta-glucosidase 


i 


6-phospho-beta-glucosidase 


aspartate aminotransferase } 




transposase (ISCg2) 


hypothetical membrane protein 




UDP-glucose dehydrogenase 


deoxycytidtne triphosphate 
deaminase 




hypothetical protein 




CJ 
t/1 
ca 
"D 

'c 

E 

CO 

cn 

O 

o 

CO 

>> 
CJ 

o 

< 

ro 
A 


15 


Matched 
length 
(aa) 




o 

CM 




CN 
CD 




co 




CD 
CD 


CM 

o 

T 




o 


CO 
CO 
CO 




CN 


CO 
CO 




CO 
CM 
CN 




O 

ra- 


20 


Similarity 
(%) 




63.8 




69.3 




59.9 




78.8 


80.9 




100.0 


70.2 




72.2 


72.3 




59.4 




se. 1 




Identity 
(%) 




34.8 




28.1 




r»~ 
co 
T 




43.9 


53.7 


100.0 


33.6 




40.5 


43.6 




30.6 




28.5 


25 

CD 

c 

"c: 
o 

30 ^ 

QJ 

35 
40 


Homologous gene 




Streptomyces coeiicolor A3(2) 
mmyQ 




Escherichia coli K12 bglC 




Clostridium longisporum B6405 
abgA 




Clostridium longisporum B6405 
i abgA 


Methylobacillus flagelfatus aat 




Corynebacterium glutamtcum 
ATCC 13032 tnp 


Streptonyces coelicolor A3(2) 
SCQ11.10C 




Sinorhizobium meltloti rkpK 


Escherichia coli K12 dcd 




Streptonyces coelicolor A3(2) 
SCC75A.16C 




Streptonyces thermoviolaceus 
nagA 


db Match 




gp:SC0276673_18 


I 


_j 
O 
o 

LU 

(D 

o 

CO 

CL 

</> 




O 

O 
_j 
o 

<' 

o 

m 

< 

CL 
cn 




o 

_J 

o 

-J 

o 

<■ 

o 

CD 

< 

CL 

</> 


qp;L78665 2 




r- 

CO 
CO 

LL 

< 
Cl 
cn 


o 
5 

o 

CO 

id 
cn 




prf:2422381B 


— ! 

o 

CJ 
LiJ 

1 

a 

o 

Q 

CL 

un 




CD 

<' 

LO 

O 

O 

CO 
Cu 
cn 




r- 
co 
o 
o 
03 
< 

CL 
CO 




Ljl — 

g£ 


CO 

O 
CD 


*r 

CM 
CD 


CD 

in 


CO 

t-n 


CO 
CN 


O 
CD 
CO 


co 
ro 


O 
CM 


1257 


ooc 


1203 


1257 


CO 

CO 


1317 


r-- 
CD 
uO 


CO 
CN 


h-. 


1689 


LO 
CO 


45 


Terminal 
(nl) 


3028163 


3028891 


3029033 


CO 

CO 

cu 

CN 
O 
CO 


CN 

co 
r- 
cn 

CM 

o 
CO 


3029702 


3030535 


3030101 


3031979 


I 3032348 


3033863 


3035437 


I 3034105 


o 

LO 
CO 

o 

CO 


| 3036845 


cn 

co 
c: 
xr 


3038942 


3038993 


CO 

r— 

! O 

; n 
o 
co 


50 


Initial 1 
(nt) j 


tr. 

r- 

c 
c*~ 


3028268 


GO 

r-» 

00 

cc 

c\ 
c 


3029474 


^1 
c 
LT 

CT 
C- 

<z 
1 r 


1 

3030061 


3030155 


3030340 


ro ! 
r\i ■ 
r- . cc 
C C- 
co r- 

o c 

ro , <r 


3032651 


3034181 


r^- 

OO 
CN 

r- 
CJ 
r* 


cc 

LT 

r- 
cc 
c* 
C 
c 


3037411 


LT 
CC 

r- 
c 
c 
c 


3038172 


3040681 


CM 
CO 

O 

CO 




So - 

CO ^ « 


- cc 

I s 


6527 


i * 

i tt* 


6529 


6530 
6631 


r- 

CC 


6633 


<3 

c*- 

ic 

CC 


6635 
6636 


6637 


QC 

c 
cc 
cc 


a 

> cc 


) o 
i rr 

J CD 

> co 


T- | OJ 

rr ! v 
CD ; CD 
| CO | CD 


CO 

cc 

cc 


*cr 

T 
CD 

i ,0 


55 


uj O ; 

LO ^ C 


1 1 


3127 


a 

• o 

r 


) m 
4 CN 

> CO 


3130 
2131 


i? 
I<- 


3133 j 


■<a 
c 

c 


r «-f 

) c 

| ? 


3136 


3137 


3138 


•> o 

- ; ? 

■) I ro 

i 


OJ 

j co j co 


r* 


i ^ 
i S 



202 

BNSDCJCID: <EP 1 108790A2_L> 



EP 1 108 790 A2 



5 
10 




Function 


t 




hypothetical protein 






hypothetical membrane protein 


acyltransferase or macroiide 3-0- 
acyltransferase 




hypothetical membrane protein 


i 
t 


hexosyltransferase 


methyl t r ansferase 


phosphoenolpyruvate carboxykinase 
(GTP) 


C4-dicarboxylate transporter 


hypothetical protein 


hypothetical protein 


mebrane transport protein 




15 




Matched 
length 

(a.a) 






1416 






CO 

to 
ro 


CO 

o 




O) 
CM 

tn 




o> 

CO 
CO 


in 

CM 


o 

CO 


CM 

ro 
ro 


CM 


O 

CM 


CO 

to 




20 




Similarity 
(%) 






49.4 








51.0 




54.8 




79.1 


73.3 


78.5 


52.7 j 


CM 

to 


85.0 


72.3 








Identity 
(%) 






29.6 






CO 
CM 


r*- 
CM 




31.2 




53.4 


58.6 


m 


24.4 


35.7 


69.1 


42.3 




25 










































30 
35 


Table 1 (continued) 


Homologous gene 






Mycobacterium leprae 
MLCB1883.13c 






Mycobacterium leprae 
MLCB1883.05C 


Streptomyces sp acyA 




Mycobacterium leprae 
MLCB1883.04C 




Mycobacterium tuberculosis 
H37Rv Rv0225 


Mycobacterium tuberculosis 
H37Rv Rv0224c 


Neocallimastix frontalis pepck 


Pyrococcus abyssi Orsay 
PAB2393 


Escherichia coli K12 yggH 


Mycobacterium tuberculosis 
H37Rv Rv0207c 


Mycobacterium tuberculosis 
H37Rv Rv0206c mrnpL3 




40 




db Match 






1 

ro 
oo 

00 

m 

o 
_i 

a 1 






I 

CO 
CO 
00 

m 
a 

_! 
CL 

cn 


o 
o 

T 

o 

L_ 

d_ 




gp:MLCB1883_3 




CO 

cn 
o 
r-^ 
CD 

ci 


pir F70961 


or 

LL 
O 
UJ 

*' 

O 
CL 

a 

CL 
i/> 


tn 

CM 

tn 

UJ 
o. 


_ j 

o 
o 

UJ 

x 1 

e> 
o 

>- 

Ol 

LO 


pir:E70959 


pir.C70839 








ORF 
(bp) 


T 


o 

CM 


3129 


CM 

to 


m 

CT> 


O 

cn 


1068 


CO 

o 


CM 
IM 

•^r 


cn 

CO 
CO 


1137 


r<- 


1830 


1011 


lO 

to 

r-- 


to 
o 
r*- 


2316 


1422 | 


45 




I 

i Terminal 

(nt) 


r-- 
co 
v 
cm 

T 

O 

ro 


CO 

o 

CM 
TT 
CD 
CO 


CO 
CO 

in 

TT 

o 
ro 


si 
?! 

o . 
ro ■ 


j 3045990 


CO 

o 
co 

c 


3046122 


O) 

r*- 
o 

CO 


cn 
r- 
rr 
cn 

M 
O 
CO 


I 3051190 


CO 

in 
cn 

O 
CO 


3051964 


3052062 


3055769 



3056531 


3057317 


3059643 


3058095 


50 




Initial 
(nt) 


TX 

o 
o 

•^r 
O 
ro 


CO 

o 
m 

CM 

O 
CO 


O 

to 
to 

CM 
VJ 

O 

CO 


_ i 

1 i 

-O- 5 

to j 
ro 

8| 


to ' 

m , 

•v * 
o : 
ro • 


to 
O 

CO 


3047189 


o 
cn 

■v 
O 
ro 


CO 

in 
o 
co 

M 

o 
ro 


3050522 


3050592 j 


NT 

cn 

to 
o 

CO 


3053891 j 


3054759 


3055867 


3056613 


3057328 


3059517) 




! 


(/> ^ 3 


lO 

to 
to 


to 

to 
to 


r- 

to 
tn 


co i 

; 

to 
to 


cn I 

tn 
to 


O I 
in 

cr. { 
cc ■ 


u ) 
CO 

co 


5652 


6553 


un 
CO 

co 


in 

LT. 
CC 
CC 


6556 


6657 


6658 


6659 


0999 


6661 


6662J 


55 




Sof 

00 ^ Q 


si 

"I 


tn 

V 
CO 


r>- 
n 

ro 


CO 
rO 


rr> 

T 
CO 


o 

tO 
lO 


3151 


CM 
LO 

CO 


3153 


■^r 
m 

to 


! 3155 


to 
in 

T— 

CO 


3157 


3158 


3159 


3160 j 


3161 


3162 j 











































203 

BNSCXDCID: <EP 1 108790A2 J_> 



EP 1 108 790 A2 



5 
10 




Function 


hypothetical membrane protein 


hypothetical membrane proteir 


< 
u 
5_ 

5 
j 

D 
/) 
TJ 

ro 

O -CZ 

1 ^ CD 

IE 2 

2 " o 

O.CD CL 


acyl-CoA synmase 
hypothetical protein 


major secreted protein PS1 protein 
precursor 


antigen 85-C 


hypothetical membrane protein 


nodulation protein 


hypothetical protein 


hypothetical protein 


phosphatidic acid phosphatase 


15 




Matched 
length 
(aa) 


CO 
CO 


oo 
o 


523 
1747 


CM CD 

cn *- 

LO CO 


LO 

CD 


ro 
ro 


CD 
CO 


LO 

cn 
rsi 


CO 

to 

*" L 


CO 

m 
to 


o 


20 




Similarity 
{%) 


o> 
rvi 
CO 


69.4 


CD CM 


62.3 
67.4 


99.5 


62.5 


61.2 


lO 


75.0 


74.7 


56.5 






Identity 
(%) 


29.1 


CO 
T 

ro 


49.7 
30.2 


33.5 
39.8 


98.6 


36.3 


37.5 


27.1 


51.2 


55.6 


28.2 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37Rv Rv0204c 


Mycobacterium tuberculosis 
H37Rv Rv0401 


Streptomyces coelicolor A3(2) 
pccB 

Streptomyces erythraeus eryA I 


Mycobacterium bovis BCG 

Mycobacterium tuberculosis 
H37Rv Rv3802c 


Corynebacterium glutamicum 
(Brevibacterium flavurn) ATCC 
17965 cop1 


Mycobacterium tuberculosis 
ERDMANN RV0129CfbDC 


Mycobacterium tuberculosis 
H37Rv Rv3805c 


Azorhizobium caulinodans 
ORS571 noeC 


Mycobacterium tuberculosis 
H37Rv Rv3807c 


Mycobacterium tuberculosis 
H37Rv Rv3808c 


Bacillus licheniformis ATCC 
9945A bcrC 




db Match 


pir:A70839 


pir H70633 


<r- LU 
1 O 
LO < 

CO 00 
CO 

u_ or 

< LL) 

CL Q 
o> */> 


< 

lO 

ro co 

O CO 

T- O 

ro 

CM UL 

o.. ] 'el 


_j ■ 

O 

ct 

o 

o 

I 

0- 

8 ! 

CL 


ZD 
I— 
O 
> 

o 

LO 
CO 

< 

CL 


pir:A70888 


< 
o 
O 

o 

LU 

o 

Ol 
%n 


CO 
CO 

co 
o 
r- 

CL. 


pir.D70888 


sp:BCRC_BACLI 






\n 


1083 


CO 
CO 

CO 


CO o 
TT CO 

in cc 
t— v 


1788 
927 


498 
1971 
1401 


219 
1023 


2058 


CO 

cn 
cn 


o 

LO 


1968 


1494 

A 77 

4 / f 


45 




Terminal 
(nt) 


3060733 


3061095 


o «- 

CO UT 
CO CJ 
C> 

to cr 
o c 
co c 


3068143 
3070214 


5 s S 

o o c 
CO co c 


3073857 
3075540 


3076715 


3078853 


3079848 


3080344 


3083960 
3083935 


50 




Initial 
(nt) 


3059651 


3060733 


3052927 

OOC77QO 


3069930 
3071140 


3071644 
3073620 j 


3074075 
3076562 


3078772 


3079848 


3080351 


3082311 


3082467 
3084411 






So' 

CO ^ 


— ro 

A3 CO 

s £ 


6664 


6665 


-> r- oo 
3 CO co 
D CO CD 

d ! co co 


6669 
6670 


6672 
6673 


I ^ 

r- 
to 

CO 


6675 


6576 


6677 


6678 

6579 


55 




° O * 
co ^ 


Il s 

9. j fo 


I ID 

! s 


lO C 
CD C 
t 

CO c 


0 ! r- co 
D CD CD 

o j co ro 


3169 
3170 


3172 
3173 


3174 


3175 


I 

I 3176 


3177 


3178 
3179 



BNSDOCID: <EP 1108790A2_L> 



204 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



3 ?S 



E 

in 



o 
o 
E 
o 
T 



O 



z 




o 




Ol 




TO 




C 




QJ 




cn 




>> 




X 




o 




o 




c 




o 




E 




o 




_c 


cn 






an 


J 


>>. 


O 






"53 


0) 


c 


TC 




X 


•5 


O 



00 

« ro 



Sg f 

cn ^ S 



CO 



CD 
CL 

! 

o 



U • CD 



CO j 
CO 

to 



8ssf|g| = i 

j w - S j m , co ; 



o 

Q. 

jC 



CD 



CO 
CO 



CO 

VJ 

o 

CT3 
ZJ 

CD 

ro rs* 
c: cn 

O CO 

E « 
o 

tn h- 
CL < 



LU 
< 
LU 
60 

a 

1 

CL 
) 

(3 

CL 



o 



E £ 

.2 co 

-Q > 

O £T 

o r- 

CO 

S X 



CN 

uo 
o 



CD 

— o 
E » 

S > 

X) > 

o cc 



a 



< 



E 



d c 

o> o 

O CL 

*— tn 

— ; <U 

CO »- 

C _1 

CL CD 



00 ^ 

1 o 



-O > 

o a: 



o 

CL 



CO 

rvi 

CD 



o 



< 



CO 
CD 
CN 



O 
CI 

o 



o 
CO 
h» 

o 

CD 
O 

ro 



CM 
CO 

o 



CO I 
CO 1 

o . 

CO I 



CO 
CO 

£ 1 



CO 

o 



cc : 
CO . 
CC i 



co 

CO 



co 
co 
co 



co 

CO 

to 



^ ! 

CO 
en 1 



tn 

co 



CO 

ro 



n j ro 



CO 

o 

CO 



Co 
O 



Is 

o tr 
5? 



o 
cl 



o 

CL 



o 
to 



F <o 

Is 
5 > 

x> > 
o a: 

CO 

2 X 



ZD 

c E 

CJ o 
(N C 



CO 
U0 
CD 



O 
cn 
to 



o 
cn 



cn 
co 

CO 



E 
< 



3 
< 



CD 
CD 



CD 
CO 



co ; o> 
00 cn 
o o 



1 I o> 

; J co 



BNSOOCID: <EP 1 108790A2_I_> 



205 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2_I_> 



206 



EP 1 108 790 A2 



5 
10 




Function 


L-lactate dehydrogenase or FMN- 
dependent dehydrogenase 




immunity repressor prolein 


1 
1 




phosphatase or reverse 
transcriptase (RNA-dependent) 




peptidase or lAA-amino acid 
hydrolase 




peptide methionine sulfoxide 
reductase 


superoxide di.smutase (Fe/Mn) 


transcriptional regulator 




multidrug resistance transporter 








hypothetical protein 


membrane transport protein 


transcriptional regulator 


two-component system response 
regulator 


15 




Matched 
length 

(a.a) 


to 
r-- 
ro 




m 






CO 

to 

CO 




CM 
CM 


j 


012 


to 


OJ 
CO 
OJ 


oo 

CO 




! 




to 

CM 


r- 


CO 


CM 
CM 
















































20 




Similari 
(%) 


68.9 




80.0 






51.3 




CO 

to 




CO 

to 


r- 
OJ 

CO 


u 

LO 

to 


o 
co 








OC 

XT 
CO 


59.3 


65.0 


75.5 






Identity 
(%) 


40.4 




45.5 






cr> 

an 

CM 




CD 

to 

CO 




to 

r- 


82.3 


32.5 


23.4 








33.8 


27.3 


37.2 


50.9 


25 
30 
35 


Table 1 (continued) 


Homologous gene 


Neisseria meningitidis lldA 




Bacillus phage phi-105 ORF1 






Caenorhabditis elegans 
Y51B11A.1 




Arabidcpsis thaliana ill! 




Escherichia coli B msrA 


Corynebacterium 
pseudodiphtheriticum sod 


Bacillus subtilis g'tC 


Corynebacterium glutarntcum 
tetA 








Mycobacterium tuberculosis 
H37Rv Rv3850 


Streptomyces cyanogenus lanj | 


Bacillus subtilis 168 yxaD 


Corynebacterium diphtheriae 
chrA 


40 




db Match 


< 
to 
o 

CO 
CO 

Os» 
CM 

"r 

CL 




sp:RPC_BPPH1 




i 

i 


J 

CD 

LTJ 
> 

LU 

CJ 
CL 

cn 




X 

< 
or 

< 

i 

_j 

_j 

CL 
cn 


1 


—i 
O 

u 

LU 

1 

or 
in 

Cl 
Cl 
</> 


pir: 140858 


sp:GLTC_BACSU 


o 

*~l 
o 
o 
o 

OJ 
r- 

< ; 

CL j 
CO * 








pir.G70654 


prf 2508244AB 


id 

CO 

u 
< 

CO 

o' 

> 

ol 
(/) 


m 
o 

CO 
CO 

oo 

CO 

CN 

tz 

CL 






SI 


1215 


CO 

o 


CM 
CO 


CO 

ro 




1617 


to 

CO 


CM 
O 


o 
uo 


CO 

to 


o 
o 

to 


M~ 
CM 
CO 


1134 


1511 




OJ i 

2 I 


ro 

CO 

to 


CO 

-cr 


to 

cn 


to 

CO 

to 


45 




Terminal 
(nt) 


3120379 


3121313 


3121909 


3121992 


3123932 


3122555 


ro j 

TT 1 

CM 
CO ] 


3124897 


3125492 


m 

CO 

LO 
OJ 

ro 


3126991 


3127494 


3129739 


3131395 


3133030 


3131508 


r- 

ro 
ro 

CO 


3133778 


3135752 


3135856 


50 




c S j 
I 


3119065 


CD 
CO 

cn 
o 
oj 

ro 


3121593 


3122129 


| 3123222 


3124172 


CO j 

s 
5 

ro ! 


CO 
CO 

rj 

LO 
OJ 

CO 


3125343 


CO 

to 

OJ 
CO 


'3126392 


3128417 


3128606 


cn 

CO 

r*- 

CO 

OJ ? 

ro j 


3132920 


oo 

CN 

c 

CO 1 

ro 

CO 


3133115 


3135268| 


3135297 


3136491 








6717 


6718 


6719 


6720 


6721 


6722 


6723 


| cO 

j r - 
to to 


6726 
_ 


CM 
r- 

to 


CO 
OJ 

f-- 
to 


6729 


6730 


6731 


CN 
ro 

r*- 
LC 


6733 


6734 


6735 


6736 


55 


i 


SEQ 
NO 
(DMA) 


3217 


5| 

CM 


i 

CD ; 

cm ! 

"1 


o ! - i 

CM • CM 

cm j cu 

ro ro 


r-i 

C>l 
CM 

ro 


[3223 


CM . 

cj ; 
ro i 


3225 


3226 


3227 


3228 


3229 


I 3230 


3231 


; 3232 1 


| 3233 
I 


3234 


cO * 

-I 


to 

CO 
CM 
CO 



BNSDOCID: <EP _1108790A2 I > 



207 



EP 1 108 790 A2 



o 



10 



15 



20 



25 



30 



35 



40 



45 



50 



£i c ro 

CO <D — 



E 

CO 



to 



ID 



T3 
CD 



_CD 
-O 
,03 



c 
cn 



E 
o 
X 



c 

QJ 



C 



O 



Q- 
XL 



a 
a. 



IS 

< 



c 

Cl 

7a 
u 

% 
o 



CM 



[CD 



lit a. 



>^ to 

o £ 
O u 



aj 

a o 

£ ™. 
o a> 

09 CO 



£ H 
aj 



< 
o 



CN 



*' 

to 
X 
O 
to 

Ql 



CO 
X 
O 

CO 



3 ^ 



5 X 



Is 
\ s > 

! OK 

; o 



c '■ to 
c ~ , co 



to 2 «. 



55 



co 2 a 



IT) 

tr> 
to 

T — 

CD 

CN 
5 



I X 

E 



O 
O 

5 ; 
< 



a 



o 

o 

) 

1 

X 

> 



11 

c b 

O (0 

^§ 

2 £ 

3 o 

CD t_) 

2! a> 



goo « 



(t> § u 



c 

0) 

o 
a. 



id 
o 



ro 

1 r-> 



I a> 
! 2 



CD 

CN 



££ 

ro J5 o 

I -CO 

; o o h- 



X -r 
O 1 
I 



CN 
O 
ro , 



5 I 



CL 



to 

CD 



U 

u 



o 

o 

UJ 

t 

O 
o 

— I 



tu 
u t> 
2^ «~ 

E «? 

o CO 

uo 

a) 

±= O 
CO co 



CN 



CO 

o 
o 

</) 

O- 

en 



_ > 

JD > 

o a: 

>^ cr> 
2 X 



O 
> 

I 



I CD i 



oo 
CN 
CO 



BNSDOCID: <EP 1108790A2_L> 



208 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 




209 

BNSDOClD:<EP _1 108790A2_I_> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



too;- — 



CO 



,03 



CD 
O 
O 

£ 
o 



W w C 

?f 2 
<u ^ 

1/1 d) C 
C W O 

V) ^ 3 
»— O- ni 

^ c! 

CO — 

d r (D 

C CL C 

§.«£ 
E 1 = 

o ro T\ 



<D 



u ^ 



CO c 



Cl 

o. 

O O 
0) in 
cn ^ 

O u 

o <U 

TO 



O O 
a. o. 



2 => 



0) 



CO 
CD 



O 
CU C 

11 
g F 

c a. 
o a 
£ < 

&5. 



5, S 
S 2 

CL < 

< Cl 
cn ^ 

CL.£ 

T* to 
o c 

im i: 



8^ 



O 
o 

UJ 

LLJ 
< 
CD 



SI 5 



E 



50 



55 



a 
< 

CO 

t 

CL 

O 
X 
CL 
Cl 



5 

(0 
LU 

cn 
a_ 

i 

< 
a. 

o 
u 

Cl 



CD 



< 
CO 

<' 

CL 



CO to 



1 



CO 2 « I 



LLi O 2 

co 2 a 



to 



ID 
CO 



U 

CO 

O 

I 

o 
o 

O- 

a) 

CO 
CJ) 



I 



to 



> 
CO 

I 

2 

rsi 



as 

i CL < 
It * 

| < CL 

CD 2r 

£ Cl 



I 8 



E "3 
rsi 



8 

CD 



00 



11' 

LU OJ 



O 

o 

LU 



< 

CL 



E 




E 






ZD 


u 




u 


mi 




E 


tn 












CD 




o> 


E 




E 






D 


is 




QJ 


1j 




Tj 


TO 
-O 


CO 


CX) _ 


QJ 






C 


CD 




>v 


CI 


£ o 


o 


in 


o c: 


O 


h- 





I o ' 



to — 



* i 

LL I 

o ; 



CO 



CD i CD CD 



CO 

I cm I 



I to! 





CD ! 


o 


CD i 


CO 


S 1 


r»- 




r- 








n 


-5-1 


o 




CD 




r«- 




CD 




o 


! 5 


O) 




CM 




CO 


i » 



BNSDOCID: <EP 1 108790A2_I_> 



210 



EP1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



45 



-O 



CU _ 



to 



T3 



o 
cn 

jO 

o 
E 
o 
X 



o 

c 

e£ 

£X O 

T- to 
O C 
Q. aj 
</» 

c ~* 
ro a> 

a» O 

C Nl 
CO c 



£ 5T 
E § 



50 



55 



£ 


CD 




Cl 


o 


rz 


*E 




ro 


5 




< 


CT) 




E 






E 


CD 


as 


ac 






cr 


CD 




ro 






cn 


o 


OJ 


a 


CN 



< 

cL 



o 
to 



1 = 



2 . < 
ro : Q 

■£ • ro 
^ 1 a> 



CD r- 

O j to 



i ° 

!•■§ 

: w 
I ° 

in 



1 CO 



3 
CL 
Hi 

c/3 
a 

i 

< 
o 

CL 



O 



O 



w2 □ 



ro j ro 



cn a> 
cn cv 

CO CO 



CO 



CO 



CO I f-- 
cr> i cn 

CN , CN 
n I ro 



8i 



<r- lO 
CO CO 
CO CD 



a) 

ro m 

o rsj 

o Lj, 
o 

^ E 



o 

CL 



O 
CL 



° E : °- 



CD 



IT) 
00 



» > 

ro ^ 
o cr 

s5 



O 
> 



o 

X 

o 
> 



r- 

CD 



E£ 



53 > 
ro _ 

o cr 

o r- 
:>*co 
2 X 



c: t 



CL ! 



t: 
o 



CO 


n 




CO 


CN 





E 2? 

-5 ro 

ro 

i3 > 

o cr 

o 

>, CO 

2 X 



> 



LL 

O 
> 





OJ 


O : 




CN 


CO 


cn 


o 


00 


CO 






ro ; 


CO 


CO 




O 


O 


CO 


ac 



cn 
o 

CO 



o 

ro j 



o 



OJ — 

xj i O 



CD CD I O 



co | < 



I co I co 

j CO | CO 



BNSDOCID: <EP 1 1C8790A2_I_> 



211 



EP 1 108 790 A2 



w 



15 



20 



30 



35 



40 



45 



50 



55 




BNSDOCID: <EP 1 108790A2_I_> 



212 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



3 



."5 



0) X. 



E - 

CO 



E 
o 
X 



45 



50 



55 




CO 
O 

c ~ 

E 5 

CO X) 

Q in 



O 
CJ 
LU 

I 

< 
< 



CM ; 
CO I 
CO j 



: co 



CM O 
CO CD 



m J co 

CO I CO 

CO CO 

CO CO 



o 
< 

m 
_} 
a 
< 

> I 

CO 



Q. 



rvj 



-cj cn ^- -5 



rc 
o 

O) 

cc 

-E J 
a> o 

o «? 
TO °- 

c o 

■2 

o P o 



CP - 

o B 

-e ^ 

5 <V 

CD CO 

i 5 H 

! cv a? 



CO TJ 
LO 

-o 

fa CD 



sip. 



*_ to 



E -5 £ 



-- 5 

'~ CO £T 



— «/> 
— w <p 

X o 



£.2 



5 ° u 

£ * 



* c 
co g 

CO -o 



o — 

CM C3 

- c 

7 CN 



C T3 



5 1 



>> CD 

£ 
o 

a. ft 

:= o 

CO CO 



o 
o 

CO 



cm i cm 



o • 



O 
o 

LU 

1 

LU 

a 
a 
I 



.£ o 

0) 

ro 

O -o 

Q_ « 

™ c 0) 
.2 o § 



o 
c 

a> _ 

°£ 

CL O 

O g 

Q. oo 

lO w_ 



— CO 

cu o 

c £ 

2 £ 

1 I 
c 



m 

CO 



CD 
CD 
CM 



s 

U 
(O 
X) 

U CD 
CL ^ 



r 

o 
$ 

LU 

I 

DC 
O 



co 



o 



ZD 
CL 
LU 
CO 
CL 

1 

< 
o 
a 



co 

CO 



I CD 



CD 
T 
CO 



BNSDOCID: <EP 1 1 08790 A2 I > 



213 



EP 1 108 790 A2 



c 

3 



10 



15 



20 



25 



30 



35 



40 



T3 
C 

8 



— - C CO 
rO <U - 

5 - 



o 
E 
o 

X 



<2 

a) 

5^ 

CL 

a> ia 
rr c 
o 

CL 

E -o 

>* a 

CO 

3 o 

CO c 

I'e 

2 S 
a. o> 



o 

CL 

E 



CL 





0J 




en 




cc 








In 




c 




a: 








">> 








bo 


TO 






o 


E 




ra 


CL 


i: 

*e 


CO 

o 



ra lo i? 

o -a jc 
-C c 

o co ^< 

O CL gpfX 
" O O Ol 

a) 55 cl S 

- £ u) C 

o -~ 0 c 



* O 

a. 

i s 



o 
o 



o 

it 



cr 
O 



c . 



50 



55 



5 u_ 



c ^ 
cTo 
o co 
u < 



m 



a. 



co 

X- 
Q- 
O 



"5 



E 
E 



o 

C7> 



CO 

CM 



CD Jj= 



o 
co 

CM 



JD 

CD Jj= 



a> o 



c^ 



CO 

o h- 

a < 



E 
o 



E 

3 



co 

w CL 

CD Jj= 



cr 

CQ 



CL 

cr 



2 §S s 

^2 ™ jo 



a n < 



00 

cc 



n 
cm 



CD 

I 

CD 
Cl 

rr_ 



CD 
cr 
o 
o 

! 

D 

CL 

cr 



T- . CM 

lO tO 

CD ! CD 

CO CD 



to 



S 

LU 

cr 

CD 

i 

o 

CL 

cr 



LO 
CO 



i • 

8_ S 

~ = 1 

c- a) ^ _ 

>> ? o 

w o a a 

52 5 8 I 

h- C -/T o 

CL 3 CL O 



um 




um 




a; 
E 




nenl 








ferr 








o 




lact 




tad 




E 




E 
3 




a3 








bact 




bad 




Brevi 


trpB 


> 

CD 


trpA 



UJ 

cr 

CD 

I 

CD 
CL 

cr 



CM 

to 
o 



LU 

cr 

CQ 

<' 

CL 

cr 



£ ° 
E^ 
S - t 

CL CM | 

- O 
CO C/J 



CL 



o 

CO 



O 
o 

UJ 



CL 



o 

CL 



v) i to 



o 
en 
< 



>sCN 

E ' 
2 ° 

CO 173 



co 

LU 

CO 
CL 

I 

LL 
CO 

O 

CL 



X 

o 

CO 
Cl 



O 
CO 
LO 
CO 



CO 

o 



o 
o 



CM | 
CO 



214 

BNSDOCID: <EP 1108790A2_L> 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



40 



o 
c 

=3 



f C (O 

ra id - 



O 

E 
o 
X 



45 



50 



2| 

O Q- 



X w 

Ct. in 

E c 
o o 

o .± 
LL CD 
(D 

E — 

o -tr 

5i 



x 

Q 



o o 



O X 

o 

of 



O 



O 
ZD 

Ql 



E C 

0J TJ 



m 

LU 



O 
Q 
< 



CD 

c < 

? £ 
^ E 

>s O 

o g.E 
to >> « 
3 £ o 

CD Q) O. 

w b a> 
— o 
55 o c 

55 ^£ 
J= to 

III 



o y 

to »— 
-a 
x 

° 6 



a; — 

^ o 

CD CO 



o 

LO 



CO 


CM 


m 


to 


CN 


o 




cO 


CM 


in 


m 


CO 


TT 


rr 




CM 


CM 


CM 


CO 


ro 


CO 



| o 



CO ^ » 



55 





! cv) a | 



in 

CO 



U 
09 
a. 



a> a. 
i= o 

CO CO 



o 



E O 

CD T3 
C CO 



CD 



tfi 



CO 

< 



>- 











<D 




o 




1 ro 












CO 




iS 




a 




-i 




"O 




Q) 








ll 






0) 




in 




ro 


» 


en 




O) 




o 







o 



is?.' 

Hi xz ' 
O <D • 
CO X> | 



CM 



ro 
c 

<u 

CD 



CM 
O 



in | in 



to 

CO 

CO 



CO 
CO 



CO 
ro 



CO 
CO 



p 

3 

o 
a» 

.2 o 
ro 

-Q > 

O DC 
o 

>- CO 

S X 



LU 
i — 
LU 
-J 

i 

o 

Q 
3 
□0 



> 



o 
ro 

en 
a> 

ro 

o E 
ro t 



CO 
CO 



CD 

cn 

£• 

o 

CM 



o 

a 

LU 

i 

or 

a: 
a 
< 
cL 



< o 



< 

O 
O 
< 



< 

a 

Ql 



CM 



OO 
CM 
CD 



CO 
CD 



CO 
CO 



BMSOOCID: <EP t108790A2J_> 



215 



EP 1 108 790 A2 



10 



15 



20 



25 



30 



35 



0) -C — 

— C TO 

CO <v 

:> - 



CD 
C 

o 
o 



ro 



E - 

CO 



o 
X 



o 

o ^ 
CL O 

cn o 
O o. 
— v) 

ST c 

V CD 

O ~ 
<u 
to 
o 

S 5- 



C7> E 

</) CO 



3 

cn 



„ o 
— o 



CO 



CN 

5 



40 



45 



50 



o 

O 
LU 



X 



^ CD 
To " 

2 | 

^1 



o t 
c o 



6 « 



cr 




o 


— > 


E 


dg 






imuri 


K12 


JC 




o 


o 




o 


jro 


TO 




Id 


"53 


o 


c 




o 


G> 


E 




ch 


CO 


CO 


CO 


LU 



s 3 

CD " 
CO CL 



O £ 



CO 

cn 



o 
c 



< 

CO 



e ; 



CO 

^ c 
e — 



in j 

fM i 



CN 



£ 



cn i co 
t i o 
~- ■ O 
CO 



CM 

CO 



o 



a: 

J— 
CO 



LU i=! 
CO 2 



55 



I lu § z 
| oo Z a 



co 
oo 
to 



,_ ! 



CO I ro 



CN 
CO 
CO 



o 
o I 

5 1 



< 



< 

LU 
Q 



O 
to 

CD 



CN 
O 



O 
O 

m 



.5 CN 

^ CN 
U > 

XI ' 

o t: 
o a 
>r > 
2 a: 



o 

CD 
cO 
r~ 

o 

CD 





c 




o 


ca 




c 


E 






o 




Q> 


-D 




o 


X> 




Q- 


a 




o> 


IU 




c 


u 




indi 


OSS 


tein 


-O 


ro 


o 


c 


"ro 


a. 


o 


"aj 
E 


cn 


o 


c 






'c 


no 


? 


<5 


o 


CO 


c 


<L) 


o 


E 


-C 


o 



a 


>- 


cn 








in 


!£ 


JO 




O 


to 


CO 


lO 


CO 


=3 


J3 


O 


6 


CO 


CO 


m 


CD 



"i3 



cn 
CN 



CN 
CO 



$ 




CO 
CD 




CD 




CO 




CN 




CO 




CN 




a 




r-~ 


o 


o 


CO 


lo I CO 



E 
ro 
"5 
o> 
E 



C 

o £ i 
O Q-l 



< 



o 

CO 
CM 



«- CN 

CO ! 

i CO I CO i 



cn 
cn 



co 
co 



CD CD 





r- 


CO 


o 


^3 


go 


T— 


CO 


CO 


CO 


CN 


co 


co 


CD 


co 


CN 


CN 


CN 


CO 


m 


CO 



BNSDOCID: <EP 1 108790A2J_> 



216 



EP1 108 790 A2 



10 



15 



20 



25 



30 



35 



o 
o 



~ C to 
€0 <U 



(2 



£ 5? 

<o o o 

Ql T3 c. 

la <u 
co 

cn £ a> 

J 5S 
* S E 

U 4) V) 

H| ° 

C O C 

o 2 to 

.- a. ^ 



CO 



o 
to 



E 
o 
X 



40 



45 



2- a 

o O 

TO C ^ 

II o 

2 -9 ai c, 

— . *-> i/i 

TO C as to 

■s '5 s ■§ 

c c a) x 



O) 



"o 



F 
o 

CL 



o 



O tO 

1 1 

2 "a 
a. a> 
rn to 

c o 



o 

F *« 

-= </i 

JD TO 

— a> 

£ E 

I r 

Q> TO 



co E 



O 
o 



CD 
O 
LU 



50 



o 
X 

X 

o 

I 



on ^ £L 



55 



C rS < 

m 9 £ 

CO o 



CO 
CN 

n 



o 
o 



co 



CD 



o 
o 



CO 
CO 
CO 



co 
tO 



LU 



JZJ > 

o tr 

>s co 
^ X 



o 

Q. 



E 

CL> 

E 



E 01 

" tr 
to . 

.O > 

o or 

o 

>*co 
5 X 



o 
o 



E o 

CD 



CO 



X > 

-Q > 
o (X 



o 
o 

CD 



— TO 

x S 



2 9 
cu „ 
S 2 

o h- 

°" TO 



CO 
CL 

X 
CO 

a 
or 



T 


CO 


CO 


O 


CN 


CO 






CM 


CD 


CD 


O 


CD 


cn 


o 


OJ 


OJ 


CO 


CO 


CO 


CO 



cn 



to 
cn 
rv 
co 



to 
to 



cr. 

to 



217 



BNSDOCID: <EP 1 10879QA2J_> 



EP 1 108 790 A2 







i 

i 
















t 






f— 








L-aspartate-alpha-decarboxylase 
precursor 










5 
10 




Function 




thioredoxin ch2, M-type 


o 
c 
c 

CO 
<TJ 

-J 

o 
E 

ro 

1 " 

a) ro 

?! 

2 CO 






hypothetical protein 


hypothetical protein 


partitioning or sporulatton protein 

1- — — 


glucose inhibited division proiein 


hypothetical membrane protein 


ribonuclease P protein componei 


50S fibosomal protein 






2-isopropylmalate synthase 


hypothetical protein 


aspartate-semialdehyde 
dehydrogenase 


3-dehydroquinase 


15 




Matched 
length 

(a.a) 




c> 


to 
cn 






CN 
CN 


CD 
CO 


CN 
CM 


CO 

in 


CO 
CO 


CO 
CN 


r— 

NT 






CD 
CO 


I 

CO ] 


to 

QO 


CO 


cn 






milarity 
(%) 




in 

CO 

r-* 


75.4 






58.5 


60.5 


O 
CO 


64.7 


75.4 


59.4 


93.6 






100.0 


100.0 


100.0 


100.0 


o 
o 
o 


20 




CO 












































Identity 

(%) 




42.0 


51.0 






"T 
CO 


37.6 


O 

tn 

CO 


36.0 


rr 


CO 

CD 
CM 


o 

CO 
CO 






100.0 


100.0 


100.0 


100.0 


100.0 


25 
30 


Table 1 (continued) 


Homologous gene 




Chlamydomonas reinhardtii thi2 


Bacillus subtilis cwlB 






Mycobacterium tuberculosis 
H37Rv R\/3916c 


Pseudomonas putida ygt2 


Mycobacterium tuberculosis 
H37Rv parB 


Escherichia coli K12 gidB 


Mycobacterium tuberculosis 
H37Rv Rv3921c 


Bacillus subtilis rnpA 


Mycobacterium avium rpmH 






Corynebacterium glutamicum 
panD 


Corynebacterium glutamicum 
ATCC 13032 leuA 


Corynebacterium glutamicum 
(Brevibacterium flavum) ATCC 
13032 orfX 


Corynebacterium glutamicum 
asd 


Corynebacterium glutamicum 
AS019 aroD 


35 
40 




db Match 




THI2 CHLRE 


ZD 
CO 

o 
< 

CO 

1 

o 






D70851 


YGI2 PSEPU 


ZD 
OL 
LU 
CO 
CL 

1 

O 
> 


GIDB ECOU 


A70852 


RNPA BACSU 


m 1 

QO 

O) 

5 
< 






CO 

LL 

< 


LEU1_CORGL 


YLEU_CORGL 


:DHAS_CORGL 


" i 

OO 

in 

CN 
< 








id 
to 


Cl 
CO 






Q. 


:ds 


:ds 


sp: 


CL 


sp: 


CL 

cn 








ds 


sp 


ds 


o 

O) 






n 


1185 


CN 

CO 


1242 


r» 
r- 
r- 


1041 


CO 

to 


1152 


r- 
co 

CO 


Cl 
CD 
CD 


tn 

CD 


CD 

cn 

CO 


CD 
CO 
CO 


cn 

rM 


CM 
CM 
CN 


CO 
O 

I ^ 


1848 


in 
tn 

CM 


1032 


"T 

rr 


45 




Terminal 
(nt) 


3300119 


3301729 


3302996 


3301989 


tn 
r- 

"<T 

nt 
o 
co 

CT' 


3302999 


3303636 


3304835 


I 3305864 


3306682 


3307S71 


3308412 


rsi 
ro 
o> 
o 

CO 

ro 


3308822 


147573 


266154 


268814 


271691 


CN 

tn 

CO 
•nT 
NT 


50 




Initial 
(nt) 


co 
c 
co 

c- 
c 


CO 

tn 
c 

C 
c 


3301755 


tn 
cr: 
r- 

CN 

c 
r* 
c 


3303435 


3303616 


3304787 


3305671 


CN 
CO 

m 
to 
c 

CO 

c 


3307632 


3308369 


3308747 


3309026 


CO 
"3" 
O 

CT. 

c 

CO 


147980 


268001 


269068 


270650 


446075 






S§ ^ 


- CC 

s 


> CT 

> a 
) tc 


6920 


rv 
a 

CC 


CN 
J CN 

> c 
) cc 


6923 


CN 
C? 

to 


6925 


tc 

IN 

C 
CC 


6927 


CC 

O 

a 
cc 


tr 
Cn 
CT 
> CC 


o 
I ro 

cn 
) X 


c*- 
cr 
cc 


6932 


6933 


6934 


6935 


6936 


55 




Sol 
to z: L 


r a 


j a 

""3 

5 r 


3420 


O 

r 


- rv 

J CN 

r -ri 

-> c 


i ! CO 
J CM 
T 
CO 


(N 

c 


3425 


CC 

o 
r 


3427 


CC 

CN 

r 


) CT 

I CN 


) c 
t c 

NJ 

> r 


> 

> r- 


3432 


3433 


CO 

NT 

CO 


tn 

CO 

NT 

CO 


CO 
CO 

NT 

CO 















218 

BNSDOCID: <EP 1108790A2_L> 



EP 1 108 790 A2 



5 
10 


Function 


elongation factor Tu 


preprotein translocase secY subuit 


— — . — j 

isocitrate dehydrogenase | 
(oxalosuccinatedecarboxvlese) 1 


acyl-CoA carboxylase or biotin- 
binding protein 


citrate synthase 


putative binding protein or peptidyl- 
prolyl cis-trans isomerase 


glycine betaine transporter 


1 — _ 

hypothetical membrane protein 


L-lysine permease 


aromatic amino acid permease 


hypothetical protein 


succinyl diaminopimelate 
desuccinylase 


proline transport system 


arginyMRNA synthetase 


15 


Matched 
length 
(aa) 


CO 

cn 

CO 


o 


CO 
CO 


cn 
m 


CO 

rr 


CO 


m 
cn 
in 


CO 
CM 


o 
in 


co 

CD 
rr 


CO 
CO 


CO 
CO 
CO 


T 
CN 

m 


o 

cn 
cn 


20 


Similarity 
(%) 


100.0 


100 0 


100 0 


1000 


100.0 


1000 


100 0 


„ 

100.0 


100.0 


100.0 


100.0 


100.0 


100 0 


100.0 




Identity 
(%) 


100.O 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100 0 


100 0 


25 ^ 

■o 

CD 
3 

_C 

C 

O 

35 
40 


Homologous gene 


Corynebacterium glutamicum 
ATCC 13059 luf 


Corynebacterium glutamicum 
(Brevibacterium flavum) MJ233 
secY 


Corynebacterium glutamicum 
ATCC 13032 icd 


Corynebacterium glutamicum 
ATCC 13032 accBC 


Corynebacterium glutamicum 
ATCC 13032 gltA 


Corynebacterium glutamicum 
ATCC 13032 fkbA 


Corynebacterium glutamicum 
ATCC 13032 betP 


Corynebacterium glutamicum 
ATCC 13032 ort2 


Corynebacterium glutamicum 
ATCC 13032 lysl 


Corynebacterium glutamicum 
ATCC 13032 aroP 


Corynebacterium glutamicum 
ATCC 13032 orf3 


Corynebacterium glutamicum 
ATCC 13032 dapE 


Corynebacterium glutamicum 
ATCC 13032 putP 


Corynebacterium glutamicum 
AS019 ATCC 13059 argS 


db Watch 


i 

o 

or 
O 
o 

i 

ZD 
h- 
li. 
LU 
Cl 
i/i 


6 

cr 
O 

(J 

1 

>- 

U ! 
U J 
CO 

tn 


_j 
O 
cn 
o 
cj 

1 

X 
Q 

cL 
tn 


< 

co 

CO 
CM 
CM 
CM 

Cl. 


_j 

e? 

or 
O 

CJ 

>' 

CO 

o 

CL 


_j 
O 
or 
O 
o 

i 

Q_ 
CO 

u_ 

cL 
u> 


i 

ID 
or 
O 

CJ 

l 

CL 
b- 
LU 
CO 
Cl 

CO 


_j 

(D 

or 
o 
o 

1 

CM 

_J 

>- 

Cl 
lo 


— > 
O 
cr 
o 
o 

\ 

CO 

> 

! 

CL 

tn 


_i 

O 
or 
O 

CJ 

1 

0_ 

O 
or 
< 

CL 
CO 


CO 

tn 

CM 
LO 

CO 
CL 


< 
o 

CO 
CO 

o 

CN 
CL 


gp:CGPUTP_1 


_j 

cr 
o 

CJ 

or 

> 

CO 
Cx 

cn 




Li_ — 

St 


CD 
CO 


O 

CM 

CO 


2214 


1773 


CO 


rT 

m 

CO 


tn 

CO 


1278 


1503 


1389 


CO 

rr 
cn 


1107 


1572 


1650 


45 


Terminal 
(nt) 


527563 


570771 


677831 


718580 


879148 


879629 


o 

OO 

f— 
CO 

rr 
cn 


1029006 


10303G9 


cn 

m 

CM 
CO 

m 


cn 

CM 

r*. 
rr 
m 


CO 

CO 
CD 
LO 


1218031 


1239923 


50 


Iniiiai 
(nt) 


526376 


CM 

in 
rr 
cn 

CO 

*n 


O 
O 
CO 
CO 


720352 


877838 


879276 


to 
cn 
cn 
rr 

T 

cn 


1030283 


1031871 


co 

CO 
CO 
rr 
Lf> 


1155676 


CO 

r- 
m 
cn 


1219602 


CM 
CO 
CO 

cv 




So « 
to ^ e 


6937 


1 6938 


6939 


6940 


6941 


! 6942 


6943 


rr 
T 
CD 
CD 


m 
rr 
cn 

CD | 


CD 
rf 
CD 
CD 


^3 

o> 

CO 


CO I 

rr 
cs 

CD j 


cn 

T 

cn 

CO 


Q 
iTj 
CD 
CO 


55 


u| 9 £ 

CO ^ o 


3437 


3438 


cn 
ro 

co 


o 
co 


rr 
rr 

CO 


CN 

? i 

CO J_ 


co 1 


rr 
CO 


* ! 

CO 


CD I 
v ! 
ri 

co • 


r-» 
rr 

T 

CO 


5 | 


3449 j 


o 

IT) 

rr 
CO 



BNSDOClD:<EP 1 108 790 A2 I > 



219 



EP 1 108 790 A2 



g 

u 
c 



10 



15 



o 

^ -D 

a> -EL a; 
ro co 



0) 



O O o 

E 2 I 



20 



25 



CO 



30 



35 



40 



45 



50 



55 



-o 



o 



,1X3 



O 

o 



3 

o 



CD 

_ o 

E CO 

5< 



O CO 

o < 



O 

O 
o 

<■ 

Q 
O 
Q 



o 

E 
o 



cd 

CD 

£ ro 

& o 

O CO 

u < 



to 



o 



o 
o 



o 



o 

X 
Q 



CO £ 
CD ID 

§8 

IS 

h- 
< 

O CO 

o < 



TO 



o 

or 
O 
u 

i 

LU 
CO 
X 



CD 

E 



ra co 
.a t 



cT£ 

O T- 

o or 



ro 

c ^ 
E c 



CO ^ 



CO 
CO 



Sol'; 5 
co ^ a I co 



o ■«- 
u tr 



si 



OJ 

E 
o 



CO 



E 

o <«- 

o tr 



o 
cr 
O 
o 

i 

O 

CO 

>- 



CD 



o I— 
O < 



II 



ra 



ro 



J3 __ 
<u 

^ O 

o h- 
U < 



CO 
CN 



CO 
T 
CM 
CO 



in 
T 

ro 



to 
CD 



o 
or 
o 
a 

i 

m 

> 



3 

CO 



a) 

c^O 
o 
O < 



O 



co 
o 
o 
o 



o 

T 

CO 

o 



XI 

o 
Cu 

<u ro 

• s « 

a. ^ o 

10 m 3 



Q_ O 



t/1 



ro 



_ o a> 

« ~ £ 

cfl ^ C 

a 3 § 



O 

o 



5 <° 

CJ 

CO 

a h- 

CJ < 



0 
tr 
O 
o 

I 

ro 
LU 



E 

Z3 



ii 

CD CL 

Tj co 
co 

O) 

cM- 
o O 
O * 



o 
o 



< 

CD 



O 
CM 



t m 
E 2 1 
ro 

(U CM 

2 S2 
a < 



o 

CY 

o 

o 

I 

CD 

O 

< 



o. 



E 
n 

y 

E 
ro 

CD 

E 2 1 

O CM 
o S 

CU 

^° 

o h- 
O < 



O 
cr 
O 
o 

< 

CJ 



CO 
CM 
IT) 



CD I CD 



T) CD 

ro ^ 

CJ CD 
C 

O CO 

o < 



BNSCXXJID: <EP 1108790A2J_> 



220 



EP 1 108 790 A2 



w 



15 



20 



25 



30 



35 



40 



50 



55 



T3 



O 

E 
o 
X 



x> 



Q 
< 



" o 
£ « 

CO 

o f— 
O < 



o 



LL 



o 



E 

a; 



it 

££ 



o 
o 



C7) 
| 

B ui 

a vi 
to ■= 

o> o> 
c t- 
cTO 

O CO 

u < 



CD 
00 
O 
LL 
< 
CL 



O 

o 



a> cm 

c O 

o h- 
o < 



u 



o 
o 



XL 
o 

Q. 



t: a» 

Q_ 4/1 



O 

o 



II 

o 



aj 

c o 

£TO 
o h- 
O < 



o 
o 



ChfT) 



c O 

^o 

c r- 
o < 



o 



2o 

CO ^ 



ro 

CD 

ED 

a3 cn 

0) *~" 

CO I 

cro 
o < 1 



zi<7 
2 E 

c — 

>^£ O 

tO w </l 

3 o 

.!? a"? 

C - (fl 

o o o 

_c c -c 

U 03 CL 



2 o 



CD 



cn 



O CO 

o < 



cn 
to 



C3 ^ < ; 



£ 

o 
E 

TO 

a> cn 

£2 

a> 

co 

o h- 
o < 



o 

a. 

< 



5 c 
,2 o 



as to 



o 
o 



CD 



a 
E 

CD 

cn 

£ O) 

QJ O 

^1 

a> *~ 
CO 

o K 

O < 



Q 
CO 
CO 



E 

;_J 
o 

£ 

TO 

"cn ^ 
E ™ 

a3 cm 

CD *~ 

CO 
^O 

O h— 

o < 



c 

~o 
a 
a. 
*o 
o 

"O 
-C 



O 
O 



O 

o 



cn 
£ 

cj 



O CO 

o < 



cr 

o 
o 

I 

CO 
ZD 

C9 



CD 
CO 
cn 



CD 
CD 



CM I — 



CM 
O 
fNJ 



s 



cn 
CO 



o 

CO 



O 

q: 
o 
o 

<• 

o 

CD 

cr 

Ql 



CO 



o 



o 

Cl 



If 

ro a) 

c — cL 

J |-S 

o 

(T3 

03 XJ 

c > a 

o O 

O ™. < 



E 



ai <0 
cO 



cr 

CO 

<' 

a 
< 
Q 



CJ 

I £ 

cd r- cn 

5 ^^2 



; o 



^ "> - 
CT 03 O 

o S° < 



o 
o 

1 

GQ 
CL 
< 
Q 



u 

oj E 

c ^ 

o cr 



o 



co 
o 



CO 
CD 



J 



BNSLXXID: <EP._ 11 08 790 A2 I > 



221 



EP 1 108 790 A2 



a> 



10 



15 



20 



25 



30 



35 



40 



45 



1 % 



S 5 

^= o> 

^ E 

^ CD 



aj _c _ — 

CO ^0 

^ c «j 

03 O *~ ' 



iS Co 

I- 

CO 



CD 
CO 



CD 

o 



o 

CD 

_o 

O 

E 



e9 

<U CN 

o h- 

a < 



c 



a 

Q 
< 



-o 

x: 



5 I 



o 
o 



E 

TO 
CO 

E 

QJ (N 

to 

^° 
cT O 
o h- 
O < 



< 

a 

CL 

co 



O — CN 



50 



55 



co 
to 
co 
cn 

CD 



C0Q_ 

II 

qj cn 
co 

<D 

c o 

O H- 

o < 



cn < 

I- 



O H 

O < 



cn 
E 

3 



TO >■» 



E £ 

o CO 
CJ < 



< 

u 



< 



TO ~ I 



CO ^ D 



O 
cn 
O 
o 

*' 

>- 

CL 



S 52 

GJ 

c o 

o f- 
O < 



o 

CO 
CN 
CD 



QJ 

c O 
cTO 

o (» 

O < 



o 
o 
o 



o 
o 



E 

TO 
CO 



to 



o 

CO 



CN 
CM 
CO 
CM 

t: 

Cl 



O 
or 
O 
o 

i 

o 
ce 

x 



1% 

a> cn 

c o 

o h- 

O < 



CD 

C) 



O 
LD 

CM 



C P. 



cu T— 

CO 

o h- 
O < 



o 
o 



o 

c 
o 



co «- 



CO <£ 

c <u 

C CJ 

QJ CN 

QJ 

o h- 

O < 



E 2 



o h- 

o < 



o 

cr 
O 
o 

i 

CD 
O 

cn 

Cl 



0) cd 
TO c 

Q) CD 

C T- 

o CO 
O < 



Li. 

< 



E _ 

.2 c 

Q> CN : 
Q> i 

c o ; 

o \— 

L> < 



.2 c 

CU CM 

cu 

CO 

o [ 
O < 



I 



m 

CN 



CN | 



O 

CD 



222 

BNSDOCID: <EP 1 108790A2_L> 



EP 1 108 



790 A2 











iel forming 








ory 






5 






Q 




erase 


'otein or 
or 


se regulat 


prephenate dehydratase 


protein 


10 




Functtor 


meso-diaminopimelate 
dehydrogenase 


porin or cell wall chanr 
protein 


acetate kinase 


phosphate acetyltransl 


multidrug resistance pi 
macrolide-efflux pump 
drug:proton antiporter 


ATP-dependent protea 
subunit 


ectoine/proline uptake 


15 




Matched 
length 

(a. a) 


o 
Cm 
co 


u~> 


o> 

CO 


cn 

CM 

CO 


cn 
in 

-«r 


CM 

m 

CO 


m 

CO 


o 
m 




























Similari 
(%) 


O 


O 


q 


o 


o 


o 


o 


o 


20 




o 
o 


O 
O 


d 
o 


o 
o 


o 
o 


o 
o 


o 
o 


d 
o 








O 


O 


o 


o 


o 


o 


o 


o 






ii 


o 
o 


o 
o 


o 
o 


o 
o 


o 
o 


o 
o 


o 
o 


d 
o 


25 
30 


ble 1 (continued) 


Homologous gene 


terium glutamicum 
dh " , 


terium glutamicum 
porA 


Corynebscterium glutamicum 
ATCC 13032 ackA 


Corynebecterium glutamicum 
ATCC 13032 pta 


Corynebacterium glutamicum 
ATCC 13032 cmr 


Corynebacterium glutamicum 
ATCC 13032 cIpB 


erium glutamicum 


Corynebacterium glutamicum 
ATCC 13032 proP 


35 


Tal 


Corynebac 
KY10755 d 


Corynebac 
MH20-22B 


Corynebacl 
pheA 






















40 




db Match 


-j 
O 
or 

o 
a 

i 

X 

Q i 
C ; 


CO 

o 

CO 
CO 
CM 
_J 

O 
u 


i 

O 
o: 

O 
u 

i 

3 

o 
< 


2516394A 


2309322A 


CLP3_C0RGL 


12 10266 A 


2501295A 








CL ! 

tn 


CL 

cn 


CL 

in 


CL 


prf; 


CL 

tn 


prf 


prf; 




! 


ii 


O 

CO 

cn 


m 

CO 


1191 


CO 

cn 


1377 


2556 


m 
cn 


1512 


45 




Terminal 
(nt) 


2736756 


*r 

cn 
r^- 
ro 
CO 
CM 


2935315 


2936508 


2962718 


2953606 


3098578 


3272563 


50 




Initial 
(nt) 


2787715 


CO 

CO 
CO 
CM 


2936505 


rr 
cn 

r^- 
CO 
cn 

CM 


CM 
CO 
CO 

cn 

CM 


2966161 


3099522 


3274074 






SEQ 

NO. 
(a.a ) 


T 

o> 
cn 
to 


5995 


6996 


5997 


6998 


5669 


7000 


7001 


55 




SEQ 
NO 
(DNA) 


s i 

" i 


in | 
cn 

ro 


3496 


cn 

ro 


3498 


3499 


3500 


3501 



BNSDOCJD: <EP _11C8790A2J_> 



223 



EP 1 108 790 A2 



Example 2 

Determination of effective mutation site 

s (1 ) Identification of mutation site based on the comparison of the gene nucleotide sequence of lysine-producing B-6 
strain with that of wild type strain ATCC 13032 

[0374] Corynebacterium glutamicum B-6, which is resistant to S-(2-aminoethyl)cysteine (AEC), rifampicin, strepto- 
mycin and 6-azauracil, is a lysine-producing mutant having been mutated and bred by subjecting the wild type ATCC 

10 13032 strain to multiple rounds of random mutagenesis with a mutagen, N-methyl-N" -nitro-N-nitrosoguanidine (NTG) 
and screening (Appl. Microbiol. Biotechnol., 32: 269-273 (1989)). First, the nucleotide sequences of genes derived 
from the B-6 strain and considered to relate to the lysine production were determined by a method similar to the above. 
The genes relating to the lysine production include lysE and lysG which are lysine-excreting genes; ddh, dapA, horn 
and lysC (encoding diaminopimelate dehydrogenase, dihydropicolinate synthase, homoserine dehydrogenase and 

is aspartokinase respectively) which are lysine-biosynthetic genes; and pyc and zwf (encoding pyruvate carboxylase 
and glucose-6-phosphate dehydrogenase, respectively) which are glucose-metabolizing genes. The nucleotide se- 
quences of the genes derived from the production strain were compared with the corresponding nucleotide sequences 
of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and analyzed. As a result, mutation points 
were observed in many genes. For example, no mutation site was observed in lysE, lysG, ddh, dapA, and the like, 

20 whereas amino acid replacement mutations were found in horn, lysC, pyc, zwf, and the like. Among these mutation 
points those which are considered to contribute to the production were extracted on the basis of known biochemical 
or genetic information. Among the mutation points thus extracted, a mutation, Val59Ala, in horn and a mutation, 
Pro458Ser, in pyc were evaluated whether or not the mutations were effective according to the following method. 

2s (2) Evaluation of mutation, Val59Ala, in horn and mutation, Pro458Ser, in pyc 

[0375] It is known that a mutation in horn inducing requirement or partial requirement for homoserine imparts lysine 
productivity to a wild type strain (Amino Acid Fermentation, ed. by Hiroshi Aida ef at.. Japan Scientific Soc.et.es Press). 
However the relationship between the mutation, Val59Ala, in horn and lysine production is not known. It can be ex- 
30 amined whether or not the mutation, Val59Ala, in horn is an effective mutation by introducing the mutation to the wild 
type strain and examining the lysine productivity of the resulting strain. On the other hand, it can be examined whether 
or not the mutation, Pro458Ser, in pyc is effective by introducing this mutation into a lysine-producing strain which has 
a deregulated lysine-bioxynthetic pathway and is free from the pyc mutation, and comparing the lysine productivity of 
the resulting strain with the parent strain. As such a lysine-producing bacterium, No. 58 strain (FERM BP-7134) was 
35 selected (hereinafter referred to the "lysine-producing No. 58 strain" or the "No. 58 strain"). Based on the above, it was 
determined that the mutation, Val59Ala, in horn and the mutation, Pro458Ser, in pyc were introduced into the wild type 
strain of Corynebacterium glutamicum ATCC 1 3032 (hereinafter referred to as the "wild type ATCC 1 3032 strain" or 
the "ATCC 13032 strain") and the lysine-producing No. 58 strain, respectively, using the gene replacement method. A 
plasmid vector pCES30 for the gene replacement for the introduction was constructed by the following method. 
40 [0376] A plasmid vector pCE53 having a kanamycin-resistant gene and being capable of autonomously replicating 
in Coryneform bacteria (/Wo/. Gen. Genet, 196: 175-178 (1984)) and a plasmid pMOB3 (ATCC 77282) containing a 
levansucrase gene (sacB) of Bacillus subtilis {Molecular Microbiology, 6: 1195-1204 (1992)) were each digested with 
Psfl Then after agarose gel electrophoresis, a pCE53 fragment and a 2.6 kb DNA fragment containing sacB were 
each extracted and purified using GENECLEAN Kit (manufactured by BIO 101). The P CE53 fragment and the 2.6 kb 
45 DNA fragment were ligated using Ligation Kit ver. 2 (manufactured by Takara Shuzo), introduced into the ATCC 13032 
strain by the electroporation method (FEMS Microbiology Letters, 65: 299 (1 989)), and cultured on BYG agar medium 
(medium prepared by adding 1 0 g of glucose, 20 g of peptone (manufactured by Kyokuto Pharmaceutical), 5 g of yeast 
extract (manufactured by Difco), and 16 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjust.ng its 
pH to 7 2) containing 25 ng/ml kanamycin at 30°C for 2 days to obtain a transformant acquiring kanamycin-res.stance. 
so as a result of digestion analysis with restriction enzymes, it was confirmed that a plasmid extracted from the resulting 
transformant by the alkali SDS method had a structure in which the 2.6 kb DNA fragment had been inserted into the 
Psfl site of pCE53. This plasmid was named pCES30. 

[0377] Next, two genes having a mutation point, horn and pyc, were amplified by PCR, and inserted into pCES30 
according to the TA cloning method (Bio Experiment Illustrated vol. 3, published by Shujunsha). Specifically, pCES30 
55 was digested with BamHI (manufactured by Takara Shuzo), subjected to an agarose gel electrophoresis, and extracted 
and purified using GENECLEAN Kit (manufactured by BIO 1 01 ). The both ends of the resulting pCES30 fragment were 
blunted with DNA Blunting Kit (manufactured by Takara Shuzo) according to the attached protocol. The blunt-ended 
pCES30 fragment was concentrated by extraction with phenol/chloroform and precipitation with ethanol, and allowed 



224 



BNSOOCID: <EP 1 108790A2_I_> 



EP1 108 790 A2 



w 



15 



to react in the presence of Taq polymerase (manufactured by Roche Diagnostics) and dTTP at 70°C for 2 hours so 
that a nucleotide, thymine (T), was added to the 3*-end to prepare a T vector of pCES30. 

[0378] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the method 
of Saito et al. (Biochem. Biophys. Acta, 72. 619 (1 963)). Using the chromosomal DNA as a template, PCR was carried 
out with Pfu turbo DNA polymelase (manufactured by Stratagene). In the mutated horn gene, the DNAs having the 
nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were used as the primer set. In the mutated pyc 
gene, the DNAs having the nucleotide sequences represented by SEQ ID NOS:7004 and 7005 were used as the primer 
set. The resulting PCR product was subjected to agarose gel electrophoresis, and extracted and purified using GENE- 
GLEAN Kit (manufactured by BIO 1 01 ). Then, the PCR product was allowed to react in the presence of Taq polymerase 
(manufactured by Roche Diagnostics) and dATP at 72°C for 10 minutes so that a nucleotide, adenine (A), was added 
to the 3'-end. 

[0379] The above pCES30 T vector fragment and the mutated horn gene (1 .7 kb) or mutated pyc gene (3.6 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATCC 13032 strain according to the e I ectrop oration method, and cultured on BYG agar medium containing 25 ug/ml 
kanamycin at 30°C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 ug/ml kanamycin, andaplasmid was extracted from the cult u ring 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 
confirmed that the plasmid had a structure in which the 1 .7 kb or 3.6 kb DNA fragmenl had been inserted into pCES30. 
The plasmids thus constructed were named respectively pChom59 and pCpyc458. 

[0380] The introduction of the mutations to the wild type ATCC 13032 strain and the lysine-producing No. 58 strain 
according to the gene replacement method was carried out according to the following method. Specifically. pChom59 
and pCpyc458 were introduced to the ATCC 13032 strain and the No. 58 strain, respectively, and strains in which the 
plasmid is integrated into the chromosomal DNA by homologous recombination were selected using the method of 
Ikeda et al. (Microbiology 144: 1863 (1998)). Then, the stains in which the second homologous recombination was 
carried out were selected by a selection method, making use of the fact that the Bacillus subtilis levansucrase encoded 
by pCES30 produced a suicidal substance (J. of Bacterid, 174: 5462 (1992)). Among the selected strains, strains in 
which the wild type horn and pyc genes possessed by the ATCC 1 3032 strain and the No. 58 strain were replaced with 
the mutated horn and pyogenes., respectively, were isolated. The method is specifically explained below. 
[0381] One strain was selected from the transformants containing the plasmid, pChom59 or pCpyc458, and the 
selected strain was cultured in BYG medium containing 20 [ig/m\ kanamycin, and pCG11 (Japanese Published Exam- 
ined Patent Application No. 91827/94) was introduced thereinto by the electroporation method. pCG11 is a plasmid 
vector having a spectinomycin-resistant gene and a replication origin which is the same as pCE53. After introduction 
of the pCGII, the strain was cultured on BYG agar medium containing 20 jxg/ml kanamycin and 1 00 \ig/m\ spectinomycin 
at 30°C for 2 days to obtain both the kanamycin- and spectinomycin-resistant transformant. The chromosome of one 
strain of these transformants was examined by the Southern blotting hybridization according to the method reported 
by Ikeda et al. (Microbiology 144: 1863 (1998)). As a result, it was confirmed that pChom59 or pCpyc458 had been 
integrated into the chromosome by the homologous recombination of the Cambell type. In such a strain, the wild type 
and mutated horn or pyc genes are present closely on the chromosome, and the second homologous recombination 
*o is liable to arise therebetween. 

[0382] Each of these transformants (having been recombined once) was spread on Sue agar medium (medium 
prepared by adding 1 00 g of sucrose, 7 g of meat extract, 1 0 g of peptone, 3 g of sodium chloride, 5 g of yeast extract 
(manufactured by Difco), and 18 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its pH 7.2) 
and cultured at 30°C for a day. Then the colonies thus growing were selected in each case. Since a strain in which the 
45 sacB gene is present converts sucrose into a suicide substrate, it cannot grow in this medium (J. BacterioL, 1 74: 5462 
(1 992)). On the other hand, a strain in which the sacB gene was deleted due to the second homologous recombination 
between the wild type and the mutated horn or pyc genes positioned closely to each other forms no suicide substrate 
and, therefore, can grow in this medium. In the homologous recombination, either the wild type gene or the mutated 
gene is deleted together with the sacB gene. When the wild type is deleted together with the sacB gene, the gene 
so replacement into the mutated type arises. 

[0383] Chromosomal DNA of each the thus obtained second recombinants was prepared by the above method of 
Saito et al. PCR was carried out using Pfu turbo DNA polymerase (manufactured by Stratagene) and the attached 
buffer. In the horn gene, DNAs having the nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were 
used as the primer set. Also, in the pyc gene was used, DNAs having the nucleotide sequences represented by SEQ 
55 id NOS:7004 and 7005 were used as the primer set. The nucleotide sequences of the PCR products were determined 
by the conventional method so that it was judged whether the horn or pyc gene of the second recombinant was a wild 
type or a mutant. As a result, the second recombinant which were called HD- 1 and No. 58pyc were target strains having 
the mutated horn gene and pyc gene, respectively. 
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(3) Lysine production test of HD-1 and No. 58pyc strains 

[0384] The HD-1 strain (strain obtained by incorporating the mutation, Val59Ala, in the horn gene into the ATCC 
1 3032 strain) and the No. 58pyc strain (strain obtained by incorporating the mutation, Pro458Ser, in the pyc gene into 
the lysine-producing No. 58 strain) were subjected to a culture test in a 5 I jar fermenter by us.ng the ATCC 13032 
strain and the lysine-producing No. 58 strain respectively as a control. Thus lysine production was examined. 
[0385] After culturing on BYG agar medium at 30°C for 24 hours, each strain was inoculated into 250 ml of a seed 
medium (medium prepared by adding 50 g of sucrose, 40 g of corn steep liquor, 8.3 g of ammonium sulfate, 1 g of 
urea 2 g of potassium dihydrogenphosphate, 0.83 g of magnesium sulfate heptahydrate, 10 mg of iron sulfate hep- 
tahydrate 1 mg of copper sulfate pentahydrate, 1 0mg of zinc sulfate heptahydrate, 10 mg of p-alanine, 5 mg of nicotinic 
acid 1 5 mg of thiamin hydrochloride, and 0.5 mg of biotin to 1 liter of water, and adjusting its P H to 7.2, then to which 
30 g of calcium carbonate had been added) contained in a 2 1 buffle-attached Erlenmeyer flask and cultured therein 
at 30°C for 1 2 to 1 6 hours A total amount of the seed culturing medium was inoculated into 1 ,400 ml of a main culture 
medium (medium prepared by adding 60 g of glucose, 20 g of corn steep liquor, 25 g of ammonium chloride, 2.5 g of 
potassium dihydrogenphosphate, 0.75 g of magnesium sulfate heptahydrate, 50 mg of iron sulfate heptahydrate, 13 
mg of manganese sulfate pentahydrate, 50 mg of calcium chloride, 6.3 mg of copper sulfate pentahydrate, 1 .3 mg of 
zinc sulfate heptahydrate, 5 mg of nickel chloride hexahydrate, 1 .3 mg of cobalt chloride hexahydrate, 1 .3 mg of am- 
monium molybdenate tetrahydrate, 14 mg of nicotinic acid, 23 mg of p-alanine, 7 mg of thiamin hydrochloride, and 
0 42 mg of biotin to 1 liter of water) contained in a 5 1 jar fermenter and cultured therein at 32»C. 1 vvm and 800 rpm 
while controlling the pH to 7.0 with aqueous ammonia. When glucose in the medium had been consumed, a glucose 
feeding solution (medium prepared by adding 400 g glucose and 45 g of ammonium chloride to 1 hter of water) was 
continuously added. The addition of feeding solution was carried out at a controlled speed so as to maintain the dis- 
solved oxygen concentration within a range of 0.5 to 3 ppm. After culturing for 29 hours, the culture was terminated. 
The cells were separated from the culture medium by centrifugation and then L-lysine hydrochloride in the supernatant 
was quantified by high performance liquid chromatography (HPLC). The results are shown in Table 2 below. 
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Table 2 


Strain 


L-Lysine hydrochloride yield (g/l) 


ATCC 13032 


0 


HD-1 


8 


No. 58 


45 


No. 58pyc 


51 



[0386] As is apparent from the results shown in Table 2, the lysine productivity was improved by introducing the 
mutation Val59Ala, in the horn gene or the mutation, Pro458Ser, in the pyc gene. Accordingly, it was found that the 
mutations are both effective mutations relating to the production of lysine. Strain, AHP-3, in which the mutation, 
Val59Ala in the homgene and the mutation, Pro458Ser, in the pycgene have been introduced into the wild type ATCC 
13032 strain together with the mutation, Thr331lle in the lysC gene has been deposited on December 5, 2000, in 
National Institute of Bioscience and Human Technology, Agency of Industrial Science and Technology (Higashi 1-1-3, 
Tsukuba-shi, Ibaraki, Japan) as FERM BP-7382. 
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Example 3 

Reconstruction of lysine-producing strain based on genome information 

[0387] The lysine-producing mutant B-6 strain (Appl. Microbiol. Biotechnol., 32. 269-273 (1989)), which has been 
constructed by multiple round random mutagenesis with NTG and screening from the wild type ATCC 13032 strain, 
produces a remarkably large amount of lysine hydrochloride when cultured in ajar at 32°C using glucose as a carbon 
source However, since the fermentation period is long, the production rate is less than 2.1 g/l/h. Breeding to reconstitute 
only effective mutations relating to the production ol lysine among the estimated at least 300 mutations introduced into 
the B-6 strain in the wild type ATCC 13032 strain was performed. 

(1) Identification of mutation point and effective mutation by comparing the gene nucleotide sequence of the B-6 strain 
55 with that of the ATCC 1 3032 strain 

[0388] As described above, the nucleotide sequences of genes derived from the B-6 strain were compared with the 



50 



226 

BNSDOCID: <EP 1108790A2_I_> 



EP 1 108 790 A2 

corresponding nucleotide sequences of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and 
analyzed to identify many mutation points accumulated in the chromosome of the B-6 strain. Among these, a mutation 
Val591Ala, in horn, a mutation, Thr311lle, in lysC, a mutation, Pro458Ser f in pyc and a mutation. Ala213Thr in zwf 
were specified as effective mutations relating to the production of lysine. Breeding to reconstitute the 4 mutations in 
the wild type strain and for constructing of an industrially important lysine-producing strain was carried out accordinq 
to the method shown below. 

(2) Construction of plasmid for gene replacement having mutated gene 

[0389] The plasmid for gene replacement, P Chom59, having the mutated horn gene and the plasmid for gene re- 
placement, pCpyc458, having the mutated pyc gene were prepared in the above Example 2(2). Plasmids for gene 
replacement having the mutated lysC and zwf were produced as described below. 

[0390] The lysC and zwf having mutation points were amplified by PCR, and inserted into a plasmid for gene re- 
placement, pCES30, according to the TA cloning method described in Example 2(2) (Bio Experiment Illustrated. Vol. 3). 
[0391] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the above 
method of Saito et ai Using the chromosomal DNA as a template, PCR was carried out with Pf u turbo DNA polymerase 
(manufactured by Stratagene). In the mutated lysC gene, the DNAs having the nucleotide sequences represented by 
SEQ ID NOS:7006 and 7007 were used as the primer set. In the mutated zwf gene, the DNAs having the nucleotide 
sequences represented by SEQ ID NOS:7008 and 7009 as the primer set. The resulting PCR product was subjected 
to agarose gel electrophoresis, and extracted and purified using GENEGLEAN Kit (manufactured by BIO 101). Then 
the PCR product was allowed to react in the presence of Taq DNA polymerase (manufactured by Roche Diagnostics) 
and dATP at 72°C for 10 minutes so that a nucleotide, adenine (A), was added to the 3'-end. 

[0392] The above pCES30 T vector fragment and the mutated lysC gene (1 .5 kb) or mutated zwf gene (2.3 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATCC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 u.g/ml 
kanamycin at 30°C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 ug/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes it was 
confirmed that the plasmid had a structure in which the 1 .5 kb or 2.3 kb DNA fragment had been inserted into pCES30. 
The plasmids thus constructed were named respectively pClysC311 and pCzwf213. 

(3) Introduction of mutation, Thr311 lie, in lysC into one point mutant HD-1 

35 [0393] Since the one mutation point mutant HD-1 in which the mutation, Val59A!a. in horn was introduced into the 
wild type ATCC 1 3032 strain had been obtained in Example 2(2), the mutation, Thr311 lie, in lysC was introduced into 
the HD-1 strain using P ClysC311 produced in the above (2) according to the gene replacement method described in 
Example 2(2). PCR was carried out using chromosomal DNA of the resulting strain and, as the primer set DNAs having 
the nucleotide sequences represented by SEQ ID NOS:7006 and 7007 in the same manner as in Example 2(2) As a 

40 result of the fact that the nucleotide sequence of the PCR product was determined in the usual manner, it was confirmed 
that the strain which was named AHD-2 was a two point mutant having the mutated lysC gene in addition to the mutated 
horn gene. 

(4) Introduction of mutation, Pro458Ser, in pyc into two point mutant AHD-2 

45 

[0394] The mutation, Pro458Ser, in pyc was introduced into the AHD-2 strain using the pCpyc458 produced in Ex- 
ample 2(2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal 
DNA of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID 
NOS:7004 and 7005 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of 
the PCR product was determined in the usual manner, it was confirmed that the strain which was named AHD-3 was 
a three point mutant having the mutated pyc gene in addition to the mutated horn gene and lysC gene. 

(5) Introduction of mutation, Ala213Thr, in zwf'mXo three point mutant AHP-3 

55 [0395] The mutation, Ala213Thr, in zwf was introduced into the AHP-3 strain using the pCzwf458 produced in the 
above (2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal DNA 
of the resulting strain and, as the primer set : DNAs having the nucleotide sequences represented by SEQ ID NOS- 
7008 and 7009 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of the PCR 
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product was determined in the usual manner, it was confirmed that the strain which was named APZ-4 was a four point 
mutant having the mutated zwf gene in addition to the mutated horn gene, lysC gene and pyc gene. 

(6) Lysine production test on HD-1 , AHD-2, AHP-3 and APZ-4 strains 

[0396] The HD-1 , AHD-2, AHP-3 and APZ-4 strains obtained above were subjected to a culture test in a 5 I jar 
fermenter in accordance with the method of Example 2(3). 
[0397] Table 3 shows the results. 

Table 3 
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Strain 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


HD-1 


8 


0.3 


AHD-2 


73 


2.5 


AHP-3 


80 


2.8 


APZ-4 


86 


3.0 



[0398] Since the lysine-producing mutant B-6 strain which has been bred based on the random mutation and selection 
shows a productivity of less than 2. 1 g/l/h, the APZ-4 strain showing a high productivity of 3.0 g/l/h is useful in industry. 

(7) Lysine fermentation by APZ-4 strain at high temperature 

[0399] The APZ-4 strain, which had been reconstructed by introducing 4 effective mutations into the wild type strain, 
was subjected to the culturing test in a 5 I jar fermenter in the same manner as in Example 2(3) , except that the cultunng 
temperature was changed to 40°C. 
[0400] The results are shown in Table 4. 

Table 4 



Temperature (°C) 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


32 
40 


86 
95 


3.0 
3.3 



[0401] As is apparent from the results shown in Table 4, the lysine hydrochloride titer and productivity in cultunng at 
a high temperature of 40°C comparable to those at 32°C were obtained. In the mutated and bred lysine-producing B- 
6 strain constructed by repeating random mutation and selection, the growth and the lysine productivity are lowered 
at temperatures exceeding 34'C so that lysine fermentation cannot be carried out, whereas lysine fermentation can 
be carried out using the APZ-4 strain at a high temperature of 40«C so that the load of cooling is greatly reduced and 
it is industrially useful. The lysine fermentation at high temperatures can be achieved by reflecting the high temperature 
adaptability inherently possessed by the wild type strain on the APZ-4 strain. 

[0402] As demonstrated in the reconstruction of the lysine-producing strain, the present invention provides a novel 
breeding method effective for eliminating the problems in the conventional mutants and acquiring industrially advan- 
tageous strains This methodology which reconstitutes the production strain by reconstituting the effective mutation is 
an approach which is efficiently carried out using the nucleotide sequence information of the genome disclosed in the 
present invention, and its effectiveness was found for the first time in the present invention. 

Example 4 

Production of DNA microarray and use thereof 

[0403] A DNA microarray was produced based on the nucleotide sequence infoimation of the ORF deduced from 
the full nucleotide sequences of Corynebacterium glutamicum ATCC 1 3032 using software, and genes of which ex- 
pression is fluctuated depending on the carbon source during culturing were searched. 

(1) Production of DNA microarray 

[0404] Chromosomal DNA was prepared from Corynebacterium glutamicum ATCC 1 3032 by the method of Saito et 



228 

BNSDOCID: <EP 1108790A2J_> 



EP 1 108 790 A2 



15 



20 



25 



30 



35 



40 



45 



50 



55 



al. ( Biochem. Biophys. Acta, 72. 619 (1963)). Based on 24 genes having the nucleotide sequences represented by 
SEQ ID NOS:207, 3433, 281, 3435, 3439. 765, 3445. 1226, 1229, 3448, 3451. 3453, 3455, 1743 3470 2132 3476 
3477, 3485.. 3488, 3489 s 3494, 3496, and 3497 from the ORFs shown in Table 1 deduced from the full genome nucle- 
otide sequence of Corynebacterium glutamicum ATCC 13032 using software and the nucleotide sequence of rabbit 
5 globin gene (GenBank Accession No. V00882) used as an internal standard, oligo DNA primers for PCR amplification 
represented by SEQ ID NOS:7010 to 7059 targeting the nucleotide sequences of the genes were synthesized in a 
usual manner. 

[0405] As the oligo DNA primers used for the PCR, 

[0406] DNAs having the nucleotide sequence represented by SEQ ID NOS:7010 and 7011 were used for the ampli- 
10 fication of the DNA having the nucleotide sequence represented by SEQ ID NO:207, 

[0407] DNAs having the nucleotide sequence represented by SEQ ID NOS:7012 and 7013 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3433, 

[0408] DNAs having the nucleotide sequence represented by SEQ ID NOS:7014 and 7015 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:281 , 

[0409] DNAs having the nucleotide sequence represented by SEQ ID NOS:7016 and 7017 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3435, 

[0410] DNAs having the nucleotide sequence represented by SEQ ID NOS:7018 and 701 9 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3439, 

[0411] DNAs having the nucleotide sequence represented by SEQ ID NOS:7020 and 7021 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:765, 

[0412] DNAs having the nucleotide sequence represented by SEQ ID NOS:7022 and 7023 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3445, 

[0413] DNAs having the nucleotide sequence represented by SEQ ID NOS:7024 and 7025 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1226, 

[0414] DNAs having the nucleotide sequence represented by SEQ ID NOS:7026 and 7027 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO: 1229, 

[0415] DNAs having the nucleotide sequence represented by SEQ ID NOS:7028 and 7029 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3448, 

[0416] DNAs having the nucleotide sequence represented by SEQ ID NOS:7030 and 7031 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3451, 

[0417] DNAs having the nucleotide sequence represented by SEQ ID NOS:7032 and 7033 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3453, 

[0418] DNAs having the nucleotide sequence represented by SEQ ID NOS:7034 and 7035 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3455, 

[0419] DNAs having the nucleotide sequence represented by SEQ ID NOS:7036 and 7037 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1 743, 

[0420] DNAs having the nucleotide sequence represented by SEQ ID NOS:7038 and 7039 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3470, 

[0421] DNAs having the nucleotide sequence represented by SEQ ID NOS:7040 and 7041 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:2132 } 

[0422] DNAs having the nucleotide sequence represented by SEQ ID NOS:7042 and 7043 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3476, 

[0423] DNAs having the nucleotide sequence represented by SEQ ID NOS:7044 and 7045 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3477, 

[0424] DNAs having the nucleotide sequence represented by SEQ ID NOS:7046 and 7047 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3485, 

[0425] DNAs having the nucleotide sequence represented by SEQ ID NOS:7048 and 7049 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3488, 

[0426] DNAs having the nucleotide sequence represented by SEQ ID NOS:7050 and 7051 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3489, 

[0427] DNAs having the nucleotide sequence represented by SEQ ID NOS:7052 and 7053 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3494, 

[0428] DNAs having the nucleotide sequence represented by SEQ ID NOS:7054 and 7055 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3496, 

[0429] DNAs having the nucleotide sequence represented by SEQ ID NOS:7056 and 7057 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3497, and 

[0430] DNAs having the nucleotide sequence represented by SEQ ID NOS:7058 and 7059 were used for the am- 
plification of the DNA having the nucleotide sequence of the rabbit globin gene, 
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as the respective primer set. . . QO „ 

r0431l The PCR was carried for 30 cycles with each cycle consisting ot 1 5 seconds at 95°C and 3 minutes at 68 c, 
using a thermal cycler (GeneAmp PCR system 9600, manufactured by Perkin Elmer), TaKaRa EX-Taq (manufactured 
bv Takara Shuzo), 1 00 ng of the chromosomal DNA and the buffer attached to the TaKaRa Ex-Taq reagent. In the case 
of the rabbit globin gene, a single-stranded cDNA which had been synthesized from rabbit globin mRNA manu ac ured 
bv Life Technologies) according to the manufacture's instructions using a reverse transcriptase RAV-2 (manufactured 
bv Takara Shuzo) The PCR product of each gene thus amplified was subjected to agarose gel electrophoresis and 
extracted and purified using QIAguick Gel Extraction Kit (manufactured by QIAGEN). The purified PGR product ^s 
concentrated by precipitating it with ethanol and adjusted to a concentration of 200 ng/pl. Each PCR produc was 
spotted on a slide glass plate (manufactured by Matsunami Glass) having MAS coating in 2 runs using GTMASS 
SYSTEM (manufactured by Nippon Laser & Electronics Lab.) according to the manufacture's instructions. 

(2) Synthesis of fluorescence labeled cDNA 

r04321 The ATCC 1 3032 strain was spread on BY agar medium (medium prepared by adding 20 g of peptone (man- 
ufactured by Kyokuto Pharmaceutical), 5 g of yeast extract (manufactured by Difco), and 1 6 g of Bactoagar (manufac- 
tured by Difco) to in 1 liter of water and adjusting its pH to 7.2) and cultured at 30'C for 2 days. Then, he cultured 
strain was further inoculated into 5 ml of BY liquid medium and cultured at 30"C overnight. Then, the cultured strain 
was further inoculated into 30 ml of a minimum medium (medium prepared by adding 5 g o( ammon.um sulfate, 5 g of 
urea 0 5 g of monopotassium dihydrogenphosphate, 0.5 g of dipotassium monohydrogenphosphate 20.9 g o mor- 
pholinopropanesulfonic acid, 0.25 g of magnesium sulfate heptahydrate, 10 mg of calcium chloride dihydrate 1 0 mg 
of manganese sulfate monohydrate, 10 mg of ferrous sulfate heptahydrate, 1 mg of zinc sulfate heptahydrate, 0.2 mg 
copper sulfate, and 0.2 mg biotin to 1 liter of water, and adjusting its pH to 6.5) containing 110 mmol/l glucose or 200 
mmol/l ammonium acetate, and cultured in an Erlenmyer flask at 30» to give 1 .0 of absorbance at 660 nm After the 
cells were prepared by centrifuging at 4«C and 5,000 rpm for 1 0 minutes, total RNA was prepared from the resulting 
cells according to the method of Bormann et al. ( Molecular Microbiology, 6: 317-326 (1992)). To avoid <=°™a*on 
with DNA the RNA was treated with Dnasel (manufactured by Takara Shuzo) at 37»C for 30 minutes and then further 
purified using Qiagen RNeasy MiniKit (manufactured by QIAGEN) according to the manufacture's instructions. To 30 
uq of the resulting total RNA, 0.6 pi of rabbit globin mRNA (50 ng/pl, manufactured by Life Technologies) and 1 pi of 
a random 6 mer primer (500 ng/pl, manufactured by Takara Shuzo) were added for denaturing at 65 C for 0 minutes, 
followed by quenching on ice. To the resulting solution, 6 pi of a buffer attached to Superscript II manufactured by 
Lifetechnologies), 3 pi of 0.1 mol/l DTT, 1 .5 pi of dNTPs (25 mmol/l dATP, 25 mmol/l dCTP, 25 mmol/l dGTP, 10 mmoV 
I dTTP) 1 5 ul of Cy5-dUTP or Cy3-dUTP (manufactured by NEN) and 2 pi of Superscript II were added, and allowed 
to stand at 25°C for 10 minutes and then at 42°C for 110 minutes. The RNA extracted from the cells using glucose as 
the carbon source and the RNA extracted from the cells using ammonium acetate were labeled with Cy5-dUTP and 
Cy3-dUTP respectively. After the fluorescence labeling reaction, the RNA was digested by adding 1.5 pi of 1 mom 
sodium hydroxide-20 mmol/l EDTA solution and 3.0 pi of 10% SDS solution, and allowed to stand at 65 C for 10 
minutes. The two cDNA solutions after the labeling were mixed and purified using Qiagen PCR punfication Kit (man- 
ufactured by QIAGEN) according to the manufacture's instructions to give a volume of 10 pi. 

(3) Hybridization 

r04331 UltraHyb (110 pi) (manufactured by Ambion) and the fluorescence-labeled cDNA solution (10 pi) were mixed 
and subjected to hybridization and the subsequent washing of slide glass using GeneTAC Hybridization Station (man- 
ufactured by Genomic Solutions) according to the manufacture's instructions. The hybridization was earned out at 
50°C, and the washing was carried out at 25°C. 

(4) Fluorescence analysis 

[0434] The fluorescence amount of each DNA array having the fluorescent cDNA hybridized therewith was measured 
using ScanArray 4000 (manufactured by GSI Lumonics). (l . . . 

[0435] Table 5 shows the Cy3 and Cy5 signal intensities of the genes having been corrected on the basis of the data 
of the rabbit globin used as the internal standard and the Cy3/Cy5 ratios. 

Table 5 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


207 


5248 


3240 


1.62 
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Table 5 (continued) 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


3433 


2239 


2694 


0.83 


281 


2370 


2595 


0.91 


3435 


2566 


2515 


1.02 


3439 


5597 


6944 


0.81 


765 


6134 


4943 


1.24 


3455 


1169 


1284 


0.91 


1226 


1301 


1493 


0.87 


1229 


1168 


1131 


1.03 


3448 


1187 


1594 


0.74 


3451 


2845 


3859 


0.74 


3453 


3498 


1705 


2.05 


3455 


1491 


1144 


1.30 


1743 


1972 


1841 


1.07 


3470 


4752 


3764 


1.26 


2132 


1173 


1085 


1.08 


3476 


1847 


1420 


1.30 






1 164 


1.10 


3485 


4539 


8014 


0.57 


3488 


34289 


1398 


24.52 


3489 


43645 


1497 


29.16 


3494 


3199 j 


2503 


1.28 


3496 


3428 


2364 


1.45 


3497 


3848 


3358 


1.15 



[0436] The ORF function data estimated by using software were searched for SEQ ID NOS:3488 and 3489 showing 
remarkably strong Cy3 signals. As a result, it was found that SEQ ID NOS:3488 and 3489 are a maleate synthase 
gene and an isocitrate lyase gene, respectively. It is known that these genes are transcriptionally induced by acetic 
acid in Corynobacterium glutamicum (Archives of Microbiology, 168: 262-269 (1 997)). 

[0437] As described above, a gene of which expression is fluctuates could be discovered by synthesizing appropriate 
ohgo DNA primers based on the ORF nucleotide sequence information deduced from the full genomic nucleotide 
sequence information of Corynebactehum glutamicum ATCC 1 3032 using software, amplifying the nucleotide sequenc- 
es of the gene using the genome DNA of Corynebactehum glutamicum as a template in the PCR reaction, and thus 
producing and using a DNA microarray. 

[0438] This Example shows that the expression amount can be analyzed using a DNA microarray in the 24 genes 
On the other hand, the present DNA microarray techniques make it possible to prepare DNA microarrays having thereon 
several thousand gene probes at once. Accordingly, it is also possible to prepare DNA microarrays having thereon all 
of the ORF gene probes deduced from the full genomic nucleotide sequence of Corynebactehum glutamicum ATCC 
13032 determined by the present invention, and analyze the expression profile at the total gene level of Corynebac- 
tehum glutamicum using these arrays. 

Example 5 

Homology search using Corynebactehum glutamicum genome sequence 
(1) Search of adenosine deaminase 

[0439] The amino acid sequence (ADD_ECOLI) of Escherichia co// adenosine deaminase was obtained from Swiss- 
prot Database as the amino acid sequence of the protein of which function had been confirmed as adenosine deaminase 
(EC3.5.4.4). By using the full length of this amino acid sequence as a query, a homology search was carried out on a 
nucleotide sequence database of the genome sequence of Corynebactehum glutamicum or a database of the amino 
acids in the ORF region deduced from the genome sequence using FASTA program (Proc. NatL Acad Sci ISA 85 
2444-2448 (1 988)). A case where E-value was le ™ or less was judged as being significantly homologous. As a result 
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no sequence significantly homologous with the Escherichia coli adenosine deaminase was found in the nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or the database of the ammo acid se- 
quences in the ORF region deduced from the genome sequence. Based on these results it is assumed tha Co^ne- 
bacterium glutamicum contains no ORF having adenosine deaminase activity and thus has no activity of converting 
adenosine into inosine. 

(2) Search of glycine cleavage enzyme 

r04401 The sequences (GCSP_ECOLI, GCST_ECOLI and GCSH_ECOLI) of glycine decarboxylase, aminomethyl 
transferase and an aminometh y r g roup carrier each of which is a component of Escherichia coli glyc.ne cleavage 
enzyme as the amino acid sequence of the protein, of which function had been confirmed as glyc.ne cleavage enzyme 
(EC2 12 10), were obtained from Swiss-prot Database. 

0441] By using these full-length amino acid sequences as a query, a homology search was carried outon a nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF amino acid 
sequences deduced from the genome sequence using FASTA program. A case where E-value was le <° o less ^was 
judged as being significantly homologous. As a result, no sequence significantly homologous with the g yc.n ^car- 
boxylase the aminomethyl transferase or the aminomethyl group carrier each of which is a component of Eschench.a 
co// glycine cleavage enzyme, was found in the nucleotide sequence database of the genome sequence of Coryne- 
bacterium glutamicum or the database of the ORF amino acid sequences estimated from the genome sequence. Based 
on these results, it is assumed that Corynebacterium glutamicum contains no ORF having the act.vrty of glyc.ne de- 
carboxylase, aminomethyl transferase or the aminomethyl group carrier and thus has no activity of the glyc.ne cleavage 
enzyme. 

(3) Search of IMP dehydrogenase 

r04421 The amino acid sequence (IMDH ECOLI) of Escherichia coli IMP dehydrogenase as the amino acid sequence 
of the protein, of which function had been confirmed as IMP dehydrogenase (EC1 .1 .1 .205), was obtained from Sw.ss- 
prot Database. By using the full length of this amino acid sequence as a query, a homology search was earned out on 
a nucleotide sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF 
amino acid sequences predicted from the genome sequence using FASTA program. A case where E-yalue was le 1 
or less was judged as being significantly homologous. As a result, the amino acid sequences encoded by two ORFs, 
namefy an ORF positioned in the region of the nucleotide sequence No. 61 5336 to 616853 (or ORF having the nucle- 
otide sequence represented by SEQ ID NO:672) and another ORF positioned in the region of the nucleot.de sequence 
No 616973 to 618094 (or ORF having the nucleotide sequence represented by SEQ ID NO:674) were significantly 
35 homologous with the ORFs of Escherichia coli IMP dehydrogenase. By using the above-described ' P r f dlc ^ ed al ^ n ° 
acid sequence as a query in order to examine the similarity of the amino acid sequences encoded by the ORFs with 
IMP dehydrogenases of other organisms in greater detail, a search was carried out on GenBank (httpiZ/ww^ncbinlm 
nih gov/) nr-aa database (amino acid sequence database constructed on the basis of GenBankCDS trans at.on prod- 
ucts PDB database, Swiss-Prot database, PIR database, PRF database by eliminating duplicated reg.s rajons) using 
40 BLAST program As a result, both of the two amino acid sequences showed significant homologies with IMP dehdy- 
rogenasesof other organisms and clearly higher homologies with IMPdehdyrogenases than with amino acid sequences 
of other proteins and thus, it was assumed that the two ORFs would function as IMP dehydrogenase. Based on these 
results, it was therefore assumed that Corynebacterium glutamicum has two ORFs having the IMP dehydrogenase 
activity. 

45 

Example 6 

Proteome analysis of proteins derived from Corynebacterium glutamicum 

50 (1) Preparations of proteins derivedfrom Corynebacterium glutamicum ATCC 13032, FERM BP-7134 and FERM BP- 
158 

r04431 Culturing tests of Corynebacterium glutamicum ATCC 13032 (wild type strain), Corynebacterium glutamicum 
FERM BP-7134 (lysine-producing strain) and Corynebacterium glutamicum (FERM BP-158, lysine-highly producing 
55 strain) were carried out in a 5 I jarfermenter according to the method in Example 2(3). The results are shown in Table 6. 
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Strain 


L-Lysine yield (g/l) 


ATCC 13032 


0 


FERM BP-7134 


45 


FERM BP-158 


60 



[0444] After culturing, cells of each strain were recovered by centrif Ligation. These cells were washed with Tris-HCI 
buffer (1 0 mmol/l Tris-HCI, pH 6.5, 1 .6 mg/ml protease inhibitor (COMPLETE; manufactured by Boehringer Mannheim)) 
three times to give washed cells which could be stored under freezing at -80°C. The freeze-stored cells were thawed 
before use, and used as washed cells. 

[0445] The washed cells described above were suspended in a disruption buffer (1 0 mmol/l Tris-HCI, pH 7 4 5 mmol/ 
I magnesium chloride, 50 mg/l RNase, 1.6 mg/ml protease inhibitor (COMPLETE: manufactured by Boehringer Man- 
nheim)), and disrupted with a disruptor (manufactured by Brown) under cooling. To the resulting disruption solution 
DNase was added to give a concentration of 50 mg/l, and allowed to stand on ice for 10 minutes. The solution was 
centnfuged (5,000 x g, 15 minutes, 4*C) to remove the undisrupted cells as the precipitate, and the supernatant was 
recovered. 

[0446] To the supernatant, urea was added to give a concentration of 9 mol/l. and an equivalent amount of a lysis 
buffer (9.5 mol/l urea, 2% NP-40, 2% Ampholine, 5% mercaploelhanol. 1.6 mg/ml protease inhibilor (COMPLETE- 
manufactured by Boehringer Mannheim) was added thereto, followed by thoroughly stirring at room temperature for 
dissolving. 

[0447] After being dissolved, the solution was centnfuged at 12.000 x g for 15 minutes, and the supernatant was 
recovered. 

25 [0448] To the supernatant, ammonium sulfate was added to the extent of 80% saturation, followed by thoroughly 
stirring for dissolving. 

[0449] After being dissolved, the solution was centrifuged (1 6,000 x g, 20 minutes, 4*C), and the precipitate was 
recovered. This precipitate was dissolved in the lysis buffer again and used in the subsequent procedures as a protein 
sample. The protein concentration of this sample was determined by the method for quantifying protein of Bradford. 

(2) Separation of protein by two dimensional electrophoresis 

[0450] The first dimensional electrophoresis was carried out as described below by the isoelectric electrophoresis 
method. 

[0451] A molded dry IPG strip gel (pH 4-7, 13 cm, Immobile DryStrips; manufactured by Amersham Pharmacia 
Biotech) was set in an electrophoretic apparatus (Multiphor II or IPGphor; manufactured by Amersham Pharmacia 
Biotech) and a swelling solution (8 mol/l urea, 0.5% Triton X-100, 0.6% dithiothreitol, 0.5% Ampholine, pH 3-10) was 
packed therein, and the gel was allowed to stand for swelling 1 2 to 1 6 hours. 

[0452] The protein sample prepared above was dissolved in a sample solution (9 mol/l urea, 2% CHAPS 1% dithi- 
othreitol. 2% Ampholine, pH 3-1 0), and then about 1 00 to 500 ucj (in terms of protein) portions thereof were taken and 
added to the swollen IPG strip gel. 

[0453] The electrophoresis was carried out in the 4 steps as defined below under controlling the temperature to 20°C: 

step 1 : 1 hour under a gradient mode of 0 to 500V; 
step 2: 1 hour under a gradient mode of 500 to 1 ,000 V; 
step 3: 4 hours under a gradient mode of 1 ,000 to 8,000 V; and 
step 4: 1 hour at a constant voltage of 8,000 V. 

[0454] After the isoelectric electrophoresis, the I PG strip gel was put off from the holder and soaked in an equilibration 
buffer A (50 mmol/l Tris-HCI, pH 6.8, 30% glycerol. 1% SDS, 0.25% dithiothreitol) for 15 minutes and another equili- 
bration buffer B (50 mmol/l Tris-HCI, pH 6.8, 6 mol/l urea, 30% glycerol, 1 % SDS, 0.45% iodo acetamide) for 1 5 minutes 
to sufficiently equilibrate the gel. 

[0455] After the equilibrium, the IPG strip gel was lightly rinsed in an SDS electrophoresis buffer (1 .4% glycine, 0.1% 
SDS, 0.3% Tris-HCI, pH 8.5), and the second dimensional electrophoresis depending on molecular weight was carried 
out as described below to separate the proteins. 

[0456] Specifically, the above IPG strip gel was closely placed on 14% polyacrylamide slub gel (14% poly aery I amide 
0.37% bisacrylamide, 37.5 mmol/l Tris-HCI, pH 8.8, 0.1% SDS, 0.1% TEMED, 0.1% ammonium persulfate) and sub- 
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jected to electrophoresis under a constant voltage of 30 mA at 20°C for 3 hours to separate the proleins. 

(3) Detection of protein spot 

5 [0457] Coomassie staining was performed by the method of Gorg et al. (Electrophoresis, 9: 531-546 (1988)) for the 
slub gel after the second dimensional electrophoresis. Specifically, the slub gel was stained under shaking at 25 C for 
about 3 hours, the excessive coloration was removed with a decoloring solution, and the gel was thoroughly washed 

raissr'Th^esults are shown in Fig. 2. The proteins derived from the ATCC 13032 strain (Fig. 2A), FERM BP-7134 
10 strain (Fig. 2B) and FERM BP-1 58 strain (Fig. 2C) could be separated and detected as spots. 

(4) In-gel digestion of detected protein spot 

r0459] The detected spots were each cut out from the gel and transferred into siliconized tube, and 400 pi of 100 
15 mmol/1 ammonium bicarbonate : acetonitrile solution (1 :1 , v/v) was added thereto, followed by shaking overnight and 
freeze-dried as such. To the dried gel, 1 0 pi of a lysylendopeptidase (LysC) solution (manufactured by WAKO, prepared 
with 0 1 % SDS-containing 50 mmol/l ammonium bicarbonate to give a concentration of 1 00 ng/pl) was added and the 
gel was allowed to stand for swelling at 0°C for 45 minutes, and then allowed to stand at 37°C for 16 hours. After 
removing the LysC solution, 20 pi of an extracting solution (a mixture of 60% acetonitrile and 5% formic acid) was 
20 added followed by ultrasonication at room temperature for 5 minutes to disrupt the gel. After the disruption, the extract 
was recovered by centrifugation (12,000 rpm, 5 minutes, room temperature). This operation was repeated twice to 
recover the whole extract. The recovered extract was concentrated by centrifugation in vacuoXo halve the liquid volume. 
To the concentrate, 20 pi of 0.1% trifluoroacetic acid was added, followed by thoroughly stirring, and the mixture was 
subjected to desalting using ZipTip (manufactured by Millipore). The protein absorbed on the earners of ZipT.p was 
25 eluted with 5 pi of a-cyano-4-hydroxycinnamic acid for use as a sample solution for analysis. 

(5) Mass spectrometry and amino acid sequence analysis of protein spot with matrix assisted laser desorption ionization 
time of flight mass spectrometer (MALDI-TOFMS) 

30 [04601 The sample solution for analysis was mixed in the equivalent amount with a solution of a peptide mixture for 
mass calibration (300 nmol/l Angiotensin II, 300 nmol/l Neurotensin, 150 nmol/l ACTHclip 18-39, 2.3 pmol/l bovme 
insulin B chain), and 1 pi of the obtained solution was spotted on a stainless probe and crystallized by spontaneously 

[0461] As measurement instruments, REFLEX MALDI-TOF mass spectrometer (manufactured by Bruker) and an 
35 N2 laser (337 nm) were used in combination . 

[0462] The analysis by PMF (peptide-mass finger printing) was carried out using integration spectra data obtained 
by measuring 30 times at an accelerated voltage of 19.0 kV and a detector voltage of 1 .50 kV under reflector mode 
conditions. Mass calibration was carried out by the internal standard method. 

[0463] The PSD (post-source decay) analysis was carried out using integration spectra obtained by successively 
40 altering the reflection voltage and the detector voltage at an accelerated voltage of 27.5 kV. 

[0464] The masses and amino acid sequences of the peptide fragments derived from the protein spot after digestion 
were thus determined. 

(6) Identification of protein spot 

[0465] From the amino acid sequence information of the digested peptide fragments derived from the protein spot 
obtained in the above (5), ORFs corresponding to the protein were searched on the genome sequence database of 
Corynebacterium glutamicum ATCC 13032 as constructed in Example 1 to identify the protein. 
[0466] The identification of the protein was carried out using MS-Fit program and MS-Tag program of intranet protein 
so prospector. 

(a) Search and identification of gene encoding high-expression protein 

[0467] In the proteins derived from Corynebacterium glutamicum ATCC 1 3032 showing high expression amounts in 
55 CBB-staining shown in Fig. 2A, the proteins corresponding to Spots-1 , 2, 3, 4 and 5 were identified by the above method^ 
[0468] As a result it was found that Spot-1 corresponded to enolase which was a protein having the ammo ac.d 
sequence of SEQ ID NO:4585; Spot-2 corresponded to phosphoglycelate kinase which was a protein having the ammo 
acid sequence of SEQ ID NO:5254; Spot-3 corresponded to glyceraldehyde-3-phosphate dehydrogenase which was 
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a protein having the amino acid sequence represented by SEO ID NO:5255; Spot-4 corresponded to fructose bis- 
phosphate aldolase which was a protein having the amino acid sequence represented by SEQ ID NO:6543; and Spot- 
5 corresponded to those phosphate isomerase which was a protein having the amino acid sequence represented bv 
SEQ ID NO:5252. 

5 [0469] These genes, represented by SEQ ID NOS:1085, 1754, 1 775, 3043 and 1752 encoding the proteins corre- 
sponding to Spots-1, 2, 3, 4 and 5, respectively, encoding the known proteins are important in the central metabolic 
pathway for maintaining the life of the microorganism. Particularly, it is suggested that the genes of Spots-2, 3 and 5 
form an operon and a high-expression promoter is encoded in the upstream thereof (J. of Eacteriol 174 6067-6086 
(1992)). 

10 [0470] Also, the protein corresponding to Spot-9 in Fig. 2 was identified in the same manner as described above, 
and it was found that Spot-9 was an elongation factor Tu which was a protein having the amino acid sequence repre- 
sented by SEQ ID No:6937, and that the protein was encoded by DNA having the nucleotide sequence represented 
bySEQIDNo:3437. 

[0471] Based on these results, the proteins having high expression level were identified by proteome analysis using 
'5 the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . Thus, the nucleotide se- 
quences of the genes encoding the proteins and the nucleotide sequences upstream thereof could be searched simul- 
taneously. Accordingly, it is shown that nucleotide sequences having a function as a high-expression promoter can be 
efficiently selected. 

20 (b) Search and identification of modified protein 

[0472] Among the proteins derived from Corynebacterium glutamicum FERM BP-7134 shown in Fig. 2B S Spots-6, 
7 and 8 were identified by the above method. As a result, these three spots all corresponded to catalase which was a 
protein having the amino acid sequence represented by SEQ ID NO:3785. 

[0473] Accordingly, all of Spots-6, 7 and 8 detected as spots differing in isoelectric mobility were all products derived 
from a catalase gene having the nucleotide sequence represented by SEQ ID No:285. Accordingly, it is shown that 
the catalase derived from Corynebacterium glutamicum FERM BP-7134 was modified after the translation. 
[0474] Based on these results, it is confirmed that various modified proteins can be efficiently searched by proteome 
analysis using the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . 
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(c) Search and identification of expressed protein effective in lysine production 

[0475] It was found out that in Fig. 2A (ATCC 13032: wild type strain), Fig. 2B (FERM BP-7134: lysine-producing 
strain) and Fig. 2C (FERM BP-158: lysine-highly producing strain), the catalase corresponding to Spot-8 and the elon- 
gation factor Tu corresponding to Spot-9 as identified above showed the higher expression level with an increase in 
the lysine productivity. 

[0476] Based on these results, it was found that hopeful mutated proteins can be efficiently searched and identified 
in breeding aiming at strengthening the productivity of a target product by the proteome analysis using the genome 
sequence database of Corynebacterium glutamicum constructed in Example 1 . 

[0477] Moreover, useful mutation points of useful mutants can be easily specified by searching the nucleotide se- 
quences (nucleotide sequences of promoter, ORF, or the like) relating to the identified proteins using the above data- 
base and using primers designed on the basis of the sequences. As a result of the fact that the mutation points are 
specified, industrially useful mutants which have the useful mutations or other useful mutations derived therefrom can 
be easily bred. 

[0478] While the invention has been described in detail and with reference to specific embodiments thereof, it will 
be apparent to one of skill in the art that various changes and modifications can be made therein without departing 
from the spirit and scope thereof. All references cited herein are incorporated in their entirety. 

50 Claims 

1 . A method for at least one of the following: 
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(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 

(D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
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said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

5 one of SEQ ID NOS:1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 

stringent conditions, and third polynucleotides comprising a sequence of 1 0 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 

10 labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 

2. The method according to claim 1 , wherein the coryneform bacterium is a microorganism belonging to the genus 
15 Corynebacterium, the genus Brevibactehum, or the genus Microbacterium. 

3 The method according to claim 2, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium gtutamicum, Corynebacterium acetoacidophiium, Corynebacterium 
acelogiutamicum, Corynebacterium caiiunae, Corynebacterium herculis, Corynebacterium lilium t Corynebacten- 

20 urn melassecota, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

4 The method according to claim 1 , wherein the polynucleotide derived from a coryneform bacterium, the polynuce- 
iotide derived from a mutant of the coryneform bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 

25 an organic acid, and analogues thereof. 

5. The method according to claim 1 , wherein the polynucleotide to be examined is derived from Escherichia coli. 



6. A polynucleotide array, comprising: 
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at least two polynucleotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID NOS:1 to 3501 , second polynucleotides which hybridize 
with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 
tinuous bases of the first or second polynucleotides, and 
35 a solid support adhered thereto. 

7. A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO:1 or a polynucleotide having a 
homology of at least 80% with the polynucleotide. 

40 8. A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID NOS:2 to 3431 , or a 
polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

9. A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931 , or a polynucleotide which hybridizes therewith under stringent conditions. 



45 



1 0 A polynucleotide which is present in the 5' upstream or 3' downstream of a polynucleotide comprising the nucleotide 
sequence of any one of SEQ ID NOS:2 to 3431 in a whole polynucleotide comprising the nucleotide sequence 
represented by SEQ ID NO:1 , and has an activity of regulating an expression of the polynucleotide. 

so 1 1 A polynucleotide comprising 1 0 to 200 continuous bases in the nucleotide sequence of the polynucleotide of any 
one of claims 7 to 1 0, or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 
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12. A recombinant DNA comprising the polynucleotide of any one of claims 8 to 11 

13. A transformant comprising the polynucleotide of any one of claims 8 to 11 or the recombinant DNA of claim 12. 

14. A method for producing a polypeptide, comprising: 
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culturing the transformant of claim 13 in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of claim 8 or 9 in the medium, and 
recovering the polypeptide from the medium. 

1 5. A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof, comprising: 

culturing the transformant of claim 13 in a medium to produce and accumulate at least one of an amino acid, 
a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 
and analogues thereof from the medium. 

16. A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS 2 to 
3431. 

17. A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931 . 

18. The polypeptide according to claim 16 or 17, wherein at least one amino acid is deleted, replaced, inserted or 
added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition. 

1 9. A polypeptide comprising an amino acid seq uence having a homology of at least 60% with the amino acid sequence 
of the polypeptide of claim 1 6 or 1 7, and having an activity which is substantially the same as that of the polypeptide. 

20. An antibody which recognizes the polypeptide of any one of claims 16 to 19. 

21. A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of claims 1 6 to 1 9 and 
partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

22. A polypeptide array, comprising: 

at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 
tides of claims 1 6 to 1 9 and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

23. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
40 form bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator 
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24. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501, target se- 
quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 
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(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

25. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

26. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
20 sequence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which is coincident with or analogous to the 
25 target sequence or target structure motif information. 

27. A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 
target nucleotide sequence derived from a coryneform bacterium, comprising the following: 
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(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 
to 3501, function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
35 2 to 3501 with the target nucleotide sequence information for determining a function of a polypeptide encoded 

by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

40 28 A method based on a computer for determining a function of a polypeptide encoded by a polypeptide encoded by 
a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the fol- 
lowing: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 , function in- 
45 formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
50 which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 

from SEQ ID NOS:2 to 3501 . 

29. A system based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

55 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 
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(ii) a data storing device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 

(iv) an output device that shows a function obtained by the comparator. 

30. A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 



(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
15 with the target amino acid sequence information: and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS 3502 to 
7001. 



31 . The system according to any one of claims 23, 25, 27 and 29, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

33. The system according to claim 31 , wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequence 
information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide sequence is 
recorded, and is usable in the system of claim 23 or 27 or the method of claim 24 or 28. 

40 36. A recording medium or storage device which is readable by a computer in which at least one amino acid sequence 
information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid sequence 
is recorded, and is usable in the system of claim 25 or 29 or the method of claim 26 or 30. 
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37. The recording medium or storage device according to claim 35 or 36, which is a computer readable recording 
medium selected from the group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 
(RAM), a read only memory (ROM), a magneto-optic disc (MO), CD-ROM, CD-R, CD-RW, DVD-ROM DVD-RAM 
and DVD-RW. 

38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Val 
residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 
is replaced with an amino acid residue other than a Val residue. 

39. A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino acid 
sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

40. The polypeptide according to claim 38 or 39, wherein the Val residue at the 59th position is replaced with an Ala 
residue. 
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41 A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro residue 
at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform bacterium is 
replaced with an amino acid residue other than a Pro residue. 

42 A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino acid 
sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

43. The polypeptide according to claim 41 or 42, wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

44. The polypeptide according to any one of claims 38 to 43, which is derived from Corynebacterium glutamicum. 

45. A DNA encoding the polypeptide of any one of claims 38 to 44. 
15 46. A recombinant DNA comprising the DNA of claim 45. 

47. A transformant comprising the recombinant DNA of claim 46. 

48. A transformant comprising in its chromosome the DNA of claim 45. 

20 

49. The transformant according to claim 47 or 48, which is derived from a coryneform bacterium. 

50. The transformant according to claim 49, which is derived from Corynebacterium glutamicum. 
25 51. A method for producing L-lysine, comprising: 

culturing the transformant of any one of claims 47 to 50 in a medium to produce and accumulate L-lysine in 
the medium, and 

recovering the L-lysine from the culture. 

30 

52. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
35 rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(Hi) introducing the mutation point into a coryneform bacterium which is free of the mutation point, or deleting 
40 the mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

53. The method according to claim 52, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
45 a signal transmission pathway. 

54. The method according to claim 52, wherein the mutation point is a mutation point relating to a useful mutation 
which improves or stabilizes the productivity. 

so 55. A method for breading a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431 , comprising: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

55 an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 

method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 
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(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

56. The method according to claim 55, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
5 a signal transmission pathway. 

57. The method according to claim 55, wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

10 58. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 

15 quence information represented by SEQ ID NOS:2 to 3431 ; 

(ii) classifying the isozyme identified in (i) into an isozyme having the same activity; 

(iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a fermentation method of the compound selected in (i) of the coryneform bac- 
terium which have been transformed with the gene obtained in (iii). 

20 

59. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 

(i) arranging a function information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 
25 (ii) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 

pathway; 

(iii) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 
in combination with information relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

30 (iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a coryneform bacterium based on the nucleotide sequence information to either 
strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 
weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

35 60. A coryneform bacterium, bred by the method of any one of claims 52 to 59. 

61. The coryneform bacterium according to claim 60, which is a microorganism belonging to the genus Corynebacte- 
rium, the genus Brevibactehum, or the genus Microbacterium. 

40 62. The coryneform bacterium according to claim 61 , wherein the microorganism belonging to the genus Corynebac- 
terium is selected from the group consisting of Corynebacterium glutamicum, Corynebactehum acetoacidophilum, 
Corynebacterium acetogtutamicum, Corynebactehum callunae, Corynebacterium herculis, corynebacterium ///- 
ium, Corynebactehum melassecoia, Corynebactehum thermoamino genes, and Corynebacterium ammonia 
genes. 

45 

63. A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of claims 60 to 62 in a medium to produce and accumulate at 
50 least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 

and analogues thereof; 
recovering the compound from the culture. 

64. The method according to claim 63, wherein the compound is L-lysine. 

55 

65. A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the following: 

(i) preparing 
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a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

5 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(iii) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
10 to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

is 66. The method according to claim 65, wherein the coryneform bacterium is a microorganism belonging to the genus 
corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

67. The method according to claim 66, wherein the microorganism belonging to the genus Corynebacteriumls selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

20 acetogiutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium iilium, Corynebacteri- 

um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

68. A biologically pure culture of Corynebacterium giutamicum AHP-3 (FERM BP-7382) . 
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FIG. 4 
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